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DNA SEQUENCES CODING FOR MODIFIED 
FACTOR VIII :C AND MODIFIED FACTOR 
VIII:C-LIKE POLYPEPTIDES AND PROCESSES FOR 
PRODUCING THESE POLYPEPTIDES IN HIGH YIELDS 

TECHNICAL FIELD OF THE INVENTION 

This invention relates to DNA sequences 
coding for modified factor VIII:C-like polypeptides 
and processes for producing them using those DNA 
sequences. More particularly, this invention relates 
to the production of modified factor VIII :C and 
modified factor VIII :C-like polypeptides which 
display the biological activity of factor VIII :C. 
In addition, the polypeptides of this invention are 
produced in higher yields than previously produced 
factor VIII:C-like polypeptides and are more easily 
purified into biochemically pure mature factor 
VIII:C. 

BACKGROUND OF THE INVENTION 

Factor VIII :C, a large plasma glycoprotein, 
functions as the procoagulant component of factor VIII , 
which plays an integral role in the cascade mechanism 
of blood coagulation [see generally, W. J. Williams 
et al., Hematology , pp. 1085-90 , McGraw-Hill, New York 
(1972)]. Factor VIII :C circulates in the blood as a 
complex with factor VIIIR:Ag (also known as 
von Willebrand factor protein) which is a large 
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protein associated with platelet aggregation and 
adhesive properties ♦ 

Factor VIII:C is synthesized as a single 
r-fta-irr macromolecular precursor, which is later cleaved 
5 to yield the fragments which constitute "mature" 

factor VIII:C. Mature factor V1II:C is composed of 
two cha ins bridged by a calcium ion; an ami n onter min al 
heavy chain of 740 amino acids , and a carboxy- terminal 
light chain of 684 amino acids. The primary trans- 

10 lation product of factor VIII :C is a single chain 

in which the heavy chain of mature factor VIII:C is 
separated from the light chain by a "maturation 
polypeptide" of 908 amino acids. The excision of 
this maturation polypeptide is initiated by pro- 

15 teolytic cleavage of the primary translation product 
by an unknown or yet unidentified protease at the 
Arg 1648 - Glu 1649 peptide bond. The initial nick 
event begins a series of successive proteolytic 
cleavages which shorten the nascent heavy chai n from 

20 its carboxy terminus . Eventually the mature heavy 
chain of 740 amino acids results and in co m b in ation 
with the light chain of 684 amino acids, comprises 
mature factor VIII zC [see L.-O. Andersson et al. 
"Isolation and Characterization of Human Factor VIII: 

25 Molecular Forms In Commercial Factor VIII Concen- 
trate, Cryoprecipitate, and Plasma," PNAS(USA) , 83, 
pp. 2979-83 (1986)]. This complex is then activated 
by thrombin by cleavage at the Arg 1689-Ser 1690 
bond [D. Eaton et al., Biochemistry , 25, pp. 505-12 

30 (1986)]. 

Haemophilia A is a sex-linked hemorrhagic 
disease which is caused by a deficiency, either in 
amount or in biological activity, of factor VIII :C. 
The symptoms of acutely bleeding haemophilia patients 
35 are treated with factor VIII traditionally purified 
from normal sera. Various methods of purification 
have been described in the literature fsee, Zimmerman 
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et al., United States patent 4,361,509; Saundrey 
et al. United States patent 4,578,218; E.G.D. 
Tuddenhem et al., "The Properties of Factor VIII 
Coagulant Activity Prepared By Immunoadsorbent 
5 chromatography, Journal of Laborat ory Clinical 
Medicine , 93, pp. 40-53 (1979); D. E. G. Austen, 
"The Chromatographic Separation of Factor VIII on 
Aminohexyl Sepharose, " British J ournal of Hematology, 
43, pp. 669-74 (1979); M. Weinstein et al., "Analysis 

10 of Factor VIII Coagulant Antigen In Normal, Thrombin- 
treated, and Hemophilic Plasma," PNAS (USA), 78, 
pp. 5137-41 (1981); P. J. Fay et al., "Purification 
And Characterization Of A Highly Purified Human Factor 
VIII Consisting Of A Single Type Of Polypeptide 

15 Chain," PNAS (USA), 79, pp. 7200-04 (1982); C. A. 
Fulcher and T. S. Zimmerman, "Characterization Of 
The Human Factor VIII Procoagulant Protein With A 
Heterologous Precipitating Antibody, " PNAS (USA), 
79, pp. 1648-52 (1982); F. Rotblat et al., Thromb. 

20 Haemostasis , 50, p. 108 (1983); C. A. Fulcher et al., 
Blood , 61, pp. 807-11 (1983)]. 

However, purification has proven to be 
difficult because of the relatively low concentration 
of factor VIII :C in serum, its tight association 

25 with the larger factor VIIIR:Ag and its sensitivity 
to degradation by serum proteases. Factor VIII sC 
when purified from plasma thus contains a heterogen- 
eous mixture of heavy chains ranging in length from 
1648 a min o acids down to 740 amino acids which result 

30 from these numerous proteolytic events [Andersson 
et al., supra , p. 2983]. The heterogenous mixture 
of chains observed in plasma-purified factor VIII :C, 
has made recovery of a substantially pure mature 
factor VIII :C almost impossible. Furthermore, tradi- 

35 tional treatment of haemophilia with factor VIII 

purified from plasma has serious drawbacks. Specifi- 
cally, it can lead to the unintended transfer of the 
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causative agents of hepatitis or the virus associated 
with Acquired Immune Deficiency Syndrome. 

In view of its importance in the treatment 
of haemophilia, numerous attempts have been made to 
5 produce large quantities of factor VIII :C using 

recombinant DNA technology [See , for example. Genetics 
Institute, PCT application W085/01961; Genentech 
European Patent application 160,457; Chiron European 
Patent application 150,735; J. J. Toole et al., 
10 "Molecular Cloning Of a cDNA Encoding Human Antihae- 
mophilic Factor 11 Nature, 312, pp. 342-47 (1984); and 
W. I. Wood et al.. Nature, 312, pp. 330-37* (1984) ] . 
However, such attempts have proven to be less 
successful than had been hoped. This is partially 
15 due to the fact that the recombinantly produced 2332 
amino acid factor VIII ;C chain is subject to proteo- 
lytic cleavage at many positions. It is also due to 
difficulties in producing recombinant factor VIII :C 
in sufficiently high yields. 

SUMMARY OF THE INVENTION 

The present invention solves the problems 
referred to above by providing DNA sequences which 
encode modified factor VIII :C and modified factor 
VIII:C-like polypeptides. These DNA sequences code 
for polypeptides which are produced in approximately 
twenty-times higher yields than previous recombinantly 
produced factor VIII :C and are more easily purified 
into biochemically pure mature factor VIII :C. 

According the present invention,. DNA 
sequences coding for modified factor VIII :C are pro- 
duced and expressed in high yields. As will be appar- 
ent from the disclosure and examples to follow, the 
modified factor VIII:C and modified factor VIII:C-like 
polypeptides of this invention are characterized by 
deletions removing a major part of the maturation 
polypeptide of factor VIII :C. The DNA sequences in 
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our preferred embodiment have a deletion of substan- 
tially all of the nucleotides coding for the matura- 
tion polypeptide. Our most preferred embodiment 
contains a deletion of all the DNA sequence coding 
5 for the maturation polypeptide. On expression of 
our DNA sequences, the heavy chain of mature factor 
VIII :C is linked directly to the light chain. Follow- 
ing a one-nick proteolytic event, the mature form of 
factor VIII :C is generated. 

1Q Finally, the present invention provides 

various anti-haemophilic compositions containing 
modified factor VIII :C and modified factor VIII :C- 
like polypeptides produced by the DNA sequences 
of this invention, and various methods of using 

15 those compositions in haemophilia treatment- 
therapy of acute or prolonged bleeding in 
haemophilia A. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 depigts a restriction map of the 

20 factor VIII :C cDNA. 

Figure 2 is a schematic depiction of the 
construction of the recombinant DNA molecule with 
the QD deletion. 

Figures 3A and 3B depict a schematic repre- 
25 sentation of the construction of the recombinant DNA 
molecule with the RE deletion. 

Figure 4 depicts a restriction endonuclease 
map of the RE deletion inserted into the mammalian 
cell expression vector pBG3l2 indicating the positions 
30 of the SV40 origin of replication/enhancer, the adeno- 
virus major late promoter, the factor VIII :C cDNA 
with the RE deletion, the 3' untranslated region of 
the factor VIII :C mRNA, and the polyadenylation site. 
Figure 5 depicts the results of an SI 
35 analysis of Factor VIII :C mRNA isolated from trans- 
fected BMT10 cells. 
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Figure 6 depicts the results of a Southern 
analysis of plasmid DNA isolated from transfected 
BMT10 cells. 

Figure 7 depicts the published DNA and 
5 amino acid sequence of factor VIII :C (EPO application 
160,457). 

DETAILED DESCRIPTION OF THE INVENTION 

In order that the invention herein described 
may be more fully understood / the following detailed 
10 description is set forth. 

In the description the following terms are 

employed: 

Nucleotide~ A monomeric unit of DNA or RNA 
consisting of a sugar moiety (pentose) a phosphate, 

15 and a nitrogenous heterocyclic base. The base is 

linked to the sugar moiety via the glycosidic carbon 
(l 1 carbon of the pentose) and that combination of 
base and sugar is called a nucleoside. The base 
characterizes the nucleotide. The four DNA bases 

2a are adenine ("A"), guanine ("G"), cytosine ("C"), 

and thymine ("T"). The four RNA bases are A r G, C, 
and uracil ("U"). 

DNA Sequence— A linear array of nucleotides 
connected one to the other by phosphodiester bonds 

25 between the 3 f and 5 1 carbons of adjacent pentoses. 

Codon — A DNA sequence of three nucleotides 
(a triplet) which encodes through mRNA an amino acid, 
a translation start signal or a translation termina- 
tion signal. For example, the nucleotide triplets 

30 TTA, TTG, CTT, CTC, CTA and CTG encode for the amino 
acid leucine ("Leu"), TAG, TAA and TGA are transla- 
tion stop signals and ATG is a translation start 
signal. 

Amino Acid — A monomeric unit of a peptide, 
35 polypeptide or protein. The twenty amino acids are: 
phenylalanine ("Phe" or "F"), leucine ("Leu", "L"), 
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isoleucine ("lie", "I"), methionine ("Met", "M" ) , 
valine ("Val", "V"), serine ("Ser", "S"), proline 
("Pro", "P"), threonine ("Thr", "T"), alanine 
("Ala", "A"), tyrosine ("Tyr", "Y" ) , histidine 
5 ("His", "H"), glutamine ("Gin", "Q"), asparagine 
( "Asn:N" ) , lysine ("Lys:K"), aspartic acid ("Asp", 
"D"), glutamic acid ("Glu", "E"), cysteine ("Cys", 
"C"), tryptophane ("Trp", "W"), arginine ("Arg", 
"R") and glycine ("Gly", "G"). 
10 Reading Frame — The grouping of codons during 

the translation of mRNA into amino acid sequences. 
During translation the proper reading frame must be 
maintained. For example, the DNA sequence 
GCTGGTTGTAAG may be expressed in three reading frames 
or phases, each of which affords a different amino 
acid sequence: 

GCT GGT TGT AAG — Ala- Gly- Cys -Lys G 
CTG GTT GTA AG — Leu-Val-Val GC TGG 
TTG.TAA G — Trp - Leu- ( STOP ) 

Polypeptide — A linear array of amino acids 
connected one to the other by peptide bonds between 
the a -amino and carboxy groups of adjacent amino 
acids. 

Genome — The entire DNA of a cell or a virus. 
It includes inter alia the structural gene coding 
for the polypeptides of the substance, as well as 
operator, promoter and ribosome binding and interac- 
tion sequences, including sequences such as the Shine- 
Dalgamo sequences. 

Gene — A DNA sequence which encodes through 
its template or messenger RNA ("mRNA") a sequence of 
amino acids characteristic of a specific polypeptide. 

Trans crip ti on — The process of producing 
mRNA from a gene or DNA sequence. 

Translation — The process of producing a 
polypeptide from mRNA. 
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Expression— The process undergone by a 
gene or DNA sequence to produce a polypeptide- It 
is a combination of transcription and translation. 

Plasmid — A nonchromosomal double- stranded 
5 DNA sequence comprising an intact "replicon" such 

that the plasmid is replicated in a host cell. When 
the plasmid is placed within a unicellular organ ism, 
the characteristics of that organism may be changed 
or transformed as a result of the DNA of the plasmid. 

10 For example, a plasmid carrying the gene for tetra- 
cycline resistance (TET**') transforms a cell previously 
sensitive to tetracycline into one which is resistant 
to it. A cell transformed by a plasmid is called a 
"transformant" . 

15 Phage or Bacteriophage — Bacterial virus, 

many of which consist of DNA sequences encapsidated 
in a protein envelope or coat ("capsid"). 

Cloning Vehicle— A. plasmid, phage DNA, 
cosmid or other DNA sequence which is able to repli- 

20 cate in a host cell, characterized by one or a small 
number of endonuclease recognition sites at which 
such DNA sequences may be cut in a determinable 
fashion without attendant loss of an essential bio- 
logical function of the DNA, e.g., replication, pro- 

25 duction of coat proteins or loss of promoter or 

binding sites, and which contains a marker suitable 
for use in the identification of transformed cells, 
e.g., tetracycline resistance or ampicillin resist- 
ance. A cloning vehicle is often called a vector. 

30 Cloning — The process of obtaining a popu- 

lation of organisms or DNA sequences derived from 
one such organism or sequence by asexual reproduction. 

Recombinant DNA Molecule or Hybrid DNA — A 
molecule consisting of segments of DNA irom different 

35 genomes which have been joined end-to-end outside of 
living cells and able to be maintained in living 
cells . 
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Mature Factor VTIItC — As used in this 
application, mature factor VIII :C is composed of 
an N- terminal heavy chain (Ala 1- Arg 74Q) linked 
to a C-terminal light chain (Glu 1649-Tyr 2332) 
5 through an alkaline metal bridge, such as calcium 
(Figure 7). 

Modified Factor VIII :C — As used in this 
application, "modified factor VIII :C" refers to poly- 
peptides characterized by a deletion of a major por- 

10 tion of the maturation polypeptide of factor VIII :C. 
For example, where the entire maturation polypeptide 
has been deleted, "modified factor VIII :C U includes 
proteins that comprise the N- terminal mature heavy 
chain and the C- terminal mature light chain of factor 

15 VIII :C linked together as a single chain. 

Modified Factor VIII;C-Like Polypeptide — 
As used in this application, "modified factor VIII rC- 
like polypeptide" includes proteins having the bio- 
logical activity of modified factor VIII iC. It 

20 also includes proteins having an amino terminal 

methionine, e.g., f-Met- factor VIII :C, and proteins 
that are characterized by other amino acid deletions , 
additions or substitutions so long as those proteins 
substantially retain the biological activity of 

25 modified factor VIII :C. 

"Modified factor VIII:C-like polypeptides" 
within the above-definition also includes -natural 
allelic variations that may exist and occur from 
individual to individual. Furthermore, it includes 

30 modified factor VIII:C-like polypeptides whose 
degree and location of glycosylation, or other 
post- translation modifications, may vary depending 
on the cellular environment of the producing host 
or tissue. 
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The present invention relates to processes 
for the production of modified factor VIII :C and 
modified factor VIII:C-like polypeptides. More 
particularly, it provides DNA sequences whiclr. permit 
the production of modified factor VIir:C and: modi- 
fied factor VIII:C-like polypeptides iiL. higtr: ydelds r 
in appropriate hosts. Polypeptides prndncedklxj. tixe . . 
DNA sequences of this invention are- useftxLritt.thesv 
clinical treatment of haemophilia A- • 

As compared to factor VTIXrC;. tftev modi fled 
factor VIII :C produced by the DNA sequences- of this 
invention lack a major portion of .' tftftv matura tion 
polypeptide of factor VIII :C. The DNA sequences 
of the present invention surprisingly express 
modified factor VIII :C in much higher yields than 
DNA sequences coding for factor VIII :C itself. 

While not wishing to be bound by theory, 
we believe that the DNA sequences of the present 
invention produce modified factor VIII :C in high 
yields- because of the absence of most or all of 
the maturation polypeptide. For example, the 
mRNA for the modified gene may be translated more 
efficiently, because the RNA coding for the long 
maturation polypeptide does not have to be trans- 
lated. In 'addition, while factor VIII :C has many 
proteolytic targets which may be attacked while the 
polypeptide is in the cell, the modified factor 
VIII :C is less subject to such proteolytic attack 
because it lacks the proteolytic targets within 
the maturation polypeptide. Furthermore, when the 
maturation polypeptide is absent, 19 of the 25 
N-linked glycosylation sites of native factor VIII :C 
are deleted, leaving only six N-liked glycosylation 
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sites on the modified polypeptide (three on the heavy 
chain and three on the light chain) . Apparently, 
because there are fewer sites to be glycosylated, 
production and purification of the modified factor 
5 VIII:C is simplified. 

In the processes of this invention, we 
modify the DNA sequence encoding factor VIII :C to 
delete from it a major portion of the DNA sequence 
encoding the maturation polypeptide. Having pre- 

10 pared a DNA sequence carrying the desired deletion 
we employ it in a variety of expression vectors and 
hosts to produce modified factor VIII :C encoded by 
it. For example, any of a wide variety of expression 
vectors are useful in expressing the modified factor 

15 VIII:C coding sequences of this invention. It also 
should be understood that DNA sequences encoding a 
modified factor VTIIrC-like polypeptide can be 
similarly produced in accordance with this invention. 
Useful expression vectors include, for 

20 example, vectors consisting of segments of 

chromosomal, non-chromosomal and synthetic DNA 
sequences, such as various known derivatives of 
SV40, known bacterial plasmids^ e.g., plasmids from 
E . coli including col El, pCRI, pBR322, pMB9 and 

25 their derivatives, wider host range plasmids, e.g., 
RP4, phage DNAs, e.g., the numerous derivatives of 
phage A., e.g., NM 989, and other DNA phages, e.g., 
M13 and Filament eous single stranded DNA phages, 
yeast plasmids such as the 2\i plasmid or derivatives 

30 thereof, and vectors derived from combinations of 

plasmids and phage DNAs, such as plasmids which have 
been modified to employ phage DNA or other expression 
control sequences. In the preferred embodiments of 
this invention, we employ pBG312, a pBR327-related 

35 vector [R. Cate et al., Cell , 45, pp. 685-98 (1986)1. 

In addition, any of a wide variety of 
expression control sequences — sequences that con- 
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trol the expression of a DNA sequence when operatively 
linked to it — may be used in these vectors to 
express the DNA sequence of this invention. Such 
useful expression control sequences, include, for 
5 example, the early and late promoters of SV40, the 
lac system, the tr£ system, the TAC or TRC system, 
the major operator and promoter regions of phage X, 
the control regions of fd coat protein, the promoter 
for 3-phosphoglycerate kinase or other glycolytic 

10 enzymes, the promoters of acid phosphatase, e.g., 
Pho5, the promoters of the yeast or -mating factors, 
and other sequences known to control the expression 
of genes of prokaryotic or eukaryotic cells or their 
viruses, and various combinations thereof. In the 

15 preferred embodiment of this invention, we employ 

adenovirus-2 major late promoter expression control 
sequences. 

A wide variety of host cells are also useful 
in producing the modified factor VIII :C of this inven- 

20 tion. These hosts may include well known eukaryotic 
and prokaryotic hosts, such as strains of E.coli, 
Pseudomonas , Bacillus , Streptomyces , fungi such as 
yeasts, and animal cells, such as CHO cells, African 
green monkey cells, such as COS1, COS7, BSC1, BSC40, 

25 and BMT10, and human cells and plant cells in tissue 
culture. In the preferred embodiments of this inven- 
tion, we prefer BMT10 African green monkey cells. 

It should of course be understood that not 
all vectors and expression control sequences will 

30 function equally well to express the modified DNA 

sequences of this invention and to produce our modi- 
fied factor VIII :C. Neither will all hosts function 
equally well with the same expression system. How- 
ever, one of skill in the art may make a selection 

35 among these vectors, expression control sequences 
and hosts without undue experimentation without 
departing from the scope of this invention. For 
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of this invention are useful in composition and 
methods for treatment of haemophilia A and in a 
variety of agents useful "in treating uncontrolled 
bleeding. 

5 While the modified factor VIII :C and 

modified factor VIII:C-like polypeptides of this 
invention may be administered in such compositions 
and methods in the form in which they are produced, 
as single chain polypeptides, it should also be under- 

10 stood that it is within the scope of this invention 
to administer the modified factor VIII :C after sub- 
jecting it to proteolytic cleavage* For example, 
modified factor VIII :C can be cleaved in vitro, into 
the heavy chain and light chain of mature factor 

15 VIII :C and linked with a calcuim or other alkaline 
metal bridge, before, during or after purification. 

The modified factor VIII :C and modified 
factor VIII:C-like polypeptides of this ' invention 
may be formulated using known methods to prepare 

20 pharmaceutically useful compositions. Such composi- 
tions also will preferably include conventional 
pharmaceutically acceptable carriers and may include 
other medicinal agents, carriers, adjuvants, 
excipients, etc., e.g., human serum albumin or plasma 

25 preparations. See, e.g., Remington's Pharmaceutical 
Sciences (E. W. Martin). The resulting formulations 
will contain ah amount of modified factor VIII :C 
effective in the recipient to treat uncontrolled 
bleeding. Administration of these polypeptides, or 

30 pharmaceutically acceptable derivatives thereof, may 
be via any of the conventional accepted modes of 
administration of factor VIII. These include paren- 
teral, subcutaneous, or intravenous administration. 

The compositions of this invention used in 

35 the therapy of haemophilia may also be in a variety 

of forms. The preferred form depends on the intended 
mode of administration and therapeutic application. 
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The dosage and dose rate will depend on a variety of 
factors for example, whether the treatment is given 
to an acutely bleeding patient or as a prophylactic 
treatment. However, the factor VIII :C level should 
5 be high enough to prevent hemorrhage and promote 
epithelialization [see discussion . in Williaps, 
Hematology , pp. 1335-43]. 

In order that this invention may be better 
understood, the following example is set forth. 
10 This example is for purposes of illustration only 
. and is not to be construed as limiting the scope of 
the invention. 

EXAMPLE 

We have constructed cDNA sequences which 
15 encode modified factor VIII :C molecules having a 

deletion of a major part of or all of the maturation 
polypeptide. To test the limits of our invention, 
we also constructed a cDNA sequence which encodes a 
polypeptide having a deletion of more than just the 
20 maturation polypeptide of factor VIII :C. 

A. ASSEMBLY OF THE FULL-LENGTH 
FACTOR VIII :C cDNA 

Referring now to Figure 1, we have presented 
therein a restriction enzyme map of the' factor VIII rC 

25 cDNA based upon the published sequence [W. I . Wood 

et al., Nature , 312, pp. 330-37 (1984); (Figure 7)}. 
The bar represents the coding sequence. Below the 
restriction enzyme map we have depicited the amino- 
terminal heavy chain of mature factor VIII :C attached 

30 by a calcium bridge to the carboxy-terminal light 
chain of mature factor VIII rC. Below the protein 
model on a bar congruent to the restriction enzyme 
map we have indicated the oligonucleotide probes 
(indicated with asterisks) which we used to screen 

35 human placenta, liver, and kidney cDNA libraries. 
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These libraries w re made using oiigo (dT) as first- 
strand primer and \gtl0 as vector. 

On this second bar are also located the 
oligonucleotide primers (left-arrows) which we used 
5 to initiate first-strand cDNA synthesis using human 
kidney mRMA as template. We made these single- 
stranded cDNA sequences double- stranded by the 
technique of Gubler and Hoffman [U. Gubler, and 
B. J. Hoffman, Gene , 25, pp. 263-69 (1983)]. We 

10 cloned them at the dC- tailed EcoRV site in pBR322. 
We then screened this plasmid-based kidney cDNA 
library with oligonucleotide probes located on 
the bar 5' to the oligonucleotide primers. 

Below the primer/probe bar in Figure l, we 

15 have displayed a collection of partial-length factor 
VIII :C cDNA and genomic subclones, which we isolated 
from these libraries. Together these encode the 
full-length cDNA gene. More information about these 
clones is presented below, in Table 1. 
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TABLE 1 



COMPENDIUM OF FACTOR VIII : C 
GENOMIC AND CDNA CLONES 

isolated from a genomic library constructed with 
cosmid pTCF [Grosveld, F.G. et. al.. Nucleic 
Acids Research, 10. dp. 6715-6732 (1982)1 



subclone length tissue- 

pUC19.2874 2874 bp 48,XXXX 

human lymphoblast 

10 isolated from olioo(dT) -primed XgtiO cDNA libraries 

clone length probe hybridization 

1.7977 (placenta) 1728 79+, 77+ 

2.73 (liver) -700 73+ 

4.73 (kidney) -220 73+ 

15 isolated from a 85, 86-primed Gubler-Hoffman 
kidney. cDNA library 

clone length probe hybridization 



20 



25 

13. 7< 

isolated from a 75, 77-primed Gubler-Hoffman 
kidney cDNA library 

clone length probe hybridization 

30 7.7475 -2700 74+, 75+ 



1.82 


-1200 


82+, 


79-, 


77- 




2.82 


-1200 


82+, 


79-, 
75+, 


77- 




3.7573 


-2700 


74-, 


73+ 




4.7573 


-2700 


74-, 


75+, 


73+ 




6.7573 


-2700 


74-, 


75+, 


73+ 




10.797783 


>1263 


82-, 


79+, 


77+, 


83+ 


11 . 797783 


>1263 


82-, 


79+, 


77+, 


83+ 


12.797783 


>1263 


82-, 


79+, 


77+, 


83+ 


13.797783 


>1263 


82-, 


79+, 


77+, 


83+ 



10 
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Prior to assembling the full-length cDNA 
gene, we constructed two intermediate plasmids. 
This was necessary because of the excessive length 
of the factor VIII :C cDNA. For our first preliminary 
construction we isolated a fragment from clone 7.7475 
extending from the Pstl site at 5163 to the Pstl 
site at 5755. We inserted this fragment into clone 
4.7573 at the Pst l site at 5755 thereby extending 
clone 4.7573. This Pst l site is shared by the inserts 
of both clone 7.7475 and clone 4.7573. By extending 
clone 4.7573 in this manner, we provided a unique 
Ndel site at 5522 in the insert of this derivative 
of clone 4.7573. We needed to create this Nde site 
because we needed a unique site at which to extend 
15 the length of this insert at its 5' end. 

As a second preliminary construction we 
introduced a polynucleotide linker in clone 2.82 at 
a location immediately 5' to the translation start 
codon of the signal sequence of factor VIII :C. The 
20 insert of clone 2.82 is at the EcoRV site of pBR322 
and its orientation is opposite to that of tetra- 
cycline resistance. The 5 1 endpoint of the insert in 
clone 2.82 is at -133 in the 5' untranslated leader 
sequence. We cleaved clone 2.82 at the Sai l site in 
25 tetracycline resistance and at the Sac I site in the 
sequence encoding the signal peptide in the insert 
of clone 2.82 and inserted the synthetic duplex 

BSGNSS 
APISAS 
N1APCT 
221211 

///// 

GTCGACTCGCGACCATGGATGCAAATAGAGCTC 

1 33 



30 



35 



H 






SAI 


N 


N 


ACN 


R 


C 


LCC 


U 


0 


112 


1 


1 



CAGCTGAGCGCTGGTACCTACGTTTATCTCGAG 

MetGlnlleGluLeu 

This ligation resulted in the introduction of a 
Sall-Nrul-Ncol polylinker immediately 5' to the start 
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codon which initiates translation of the signal 
sequence of factor VIII :C. These three restriction 
enzymes do not cleave the full-length factor VIII :C 
cDNA gene* 

5 With these two intermediate constructions 

available, we assembled the full-length factor VIII :C 
cDNA in a six-fragment ligation reaction (bottom 
Figure 1) . It was necessary to create the full length 
DNA in this manner because we never isolated the 

10 full DNA in one single clone. We isolated, fragment 
1 from the above-described derivative of clone 2.82. 
Fragment 1 extended from Sai l in the polylinker to 
Ava l at 731. We isolated Fragment 2 from the insert 
in the XgtlO recombinant 1.7977. Fragment 2 extended 

15 from Ava l at 731 to EcoR I at 2289. Fragment 3 derived 
from the subclone pUC19.2874 of a genomic cosmid. 
recombinant; it extended from EcoR I at 2289 to BamH I 
at 4743. Fragment 4 was isolated from clone 7.74575, 
starting from the BamBI site at 4743 and extending 

20 to the Nde l site at 5522. We isolated fragment 5 

from the above-described derivative of clone 4.7573. 
Fragment 5 extended from Ndel at 5522 to Ncol at 
7991. Fragment 6 is an assembly vector containing 
an E.coli replication origin and selectable marker 

25 for ampicillin resistance. We isolated Fragment 6 
from pAT.SV2.tPA, a gift from Richard Fisher. This 
is a plasmid in which the transcription of the tPA 
gene is under the control of the SV40 early promoter. 
We digested pAT.SV2.tPA with Sai l, which cleaves within 

30 the tetracycline resistance marker , and with Nco l 
which cleaves within the SV40 early region. 

Of the 96 recombinants we analyzed, 32 
contained all five factor VIII :C restriction frag- 
ments. We determined the DNA sequence of one of 

35 these clones , and we identified two changes with 

respect to the published sequence. One is a CTG to 



*3 



>;<> 
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VIII :C» This clone/ which we named signal-minus, 
provided a negative control for our transient 
expression assays, described below. 

C. CONSTRUCTION OF GLN 744 - ASP 1563 
5 ( ABBREVIATED QD) DELETION 

In this section we demonstrate how we created 
the QD deletion which removes a portion of DNA sequence 
coding on expression for the maturation polypeptide 
(amino acids 741-1648). The QD deletion retains 

10 approximately 90 amino acids of the maturation poly- 
peptide (four amino acides at the N- terminal end of 
the maturation polypeptide and 86 amino acids at its 
carboxy terminal end).. 

Referring now to Figure 2, we depict therein 

15 the construction of the QD deletion. We partially 
digested one aliquot of the expression plasmid for 
the full-length factor VIII :C gene with EcoRI. This 
endonuclease cleaves between the codons for Gin 744 
and Asn 745. We removed the 5 f AATT overhang with 

20 nuclease Si, and then subjected the plasmid to com- 
plete digestion with Pvu l within the ampicillin 
resistance gene. We partially digested another 
aliquot with BamH l, which cleaves between the codons 
for Leu 1562 and Asp 1563 (see Figure 7). We filled 

25 out the 5'GATC overhang with the Klenow fragment, 

and again digested the plasmid with Pvul within amp. 
We then combined the two mixtures of fragments and 
ligated them with T4 DNA ligase. A BamR l site between 
the codons for Gin 744 and Asp 1563 was created in 

30 this fusion. 

The modified polypeptide produced on expres- 
sion as a result of the QD deletion lacks 818 amino 
acids from within the 908 amino-acid maturation poly- 
peptide, leaving 4 amino acids C-terminal to the 

35 carboxy terminus of the mature heavy chain, Arg 740, 
and leaving 86 amino acids N- terminal to the amino 
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terminus of the light chain, Glu 1649 (Figure 7). 
The 908 amino-acid maturation polypeptide is thus, 
replaced by a 90- amino-acid maturation polypeptide , 
with the protease substates for both initial 
5 maturation of the primary translation polypeptide 
and subsequent maturation of the heavy chain 
remaining intact. 

D. CONSTRUCTION OF THE ARG 740 - GLU 1649 
(ABBREVIATED RE) DELETION 

10 We demonstrate in this section how we 

created the RE deletion, which removes the entire 
DNA sequence coding for the maturation polypeptide. 

Referring now to Figures 3 A and B, we show 
how we obtained this RE deletion fusion in two steps. 

15 In the first step we ligated four fragments which 
resulted in an intermediate plasmid. These four 
fragments were: 

(1) the 462 bp fragment, obtained by 
digesting the expression plasmid for the full-length 

20 gene with Hin di 1 1 between the codons for Arg 740 and 
Ser 741, removing the 5' AGCT with nuclease Si, and 
subsequently digesting with Kpn l which cleaves 
uniquely between the codons for Tyr 586 and Leu 587. 

(2) the synthetic oligonucleotide duplex 

25 fragment 

5 f pGAA ATA ACT CGT ACT ACT CTT CAG TCA 

CTT TAT TGA GCA TGA TGA GAA GTC AGT CTA Gp 5 ? 
Glu lie Thr Arg Thr Thr Leu Gin Ser Asp 
1649 1657 
30 (3) the 135 bp fragment obtained by digest- 

ing the expression plasmid for the full-length gene 
first with Sau3A; we isolated the 411 bp fragment 
which resulted from Sau3A digestion between the codons 
for Ser 1657 and Asp 1658 and between the codons for 
35 Glu 1794 and Asp 1795. Then, we digested the 411 bp 
fragment with PstI which cleaves between the codons 
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.for Ala 1702 and Val 1703, to obtain the 135 bp 
5 r fragment. 

(4) pUC18 digested with Kpn l and Pstl. 

We then isolated a fragment encoding the 
5 RE fusion from this intermediate plasmid. To do 

this, we digested the intermediate plasmid generated 
in the four- fragment ligation with Asp7 18 and Pstl. 
The fragment encoding the RE fusion was used to 
replace the corresponding fragment in the expression 

10 plasmid for the QD fusion. We ligated the resultant 
624 bp fragment encoding the RE fusion to the mixture 
of fragments which we obtained by first completely 
digesting the expression plasmid for the QD internal 
deletion at the unique Asp 718 site, next dephos- 

15 phorylating the 5* GTAC overhang with calf intestinal 
phosphatase, and then partially digesting the plasmid 
with Pst l. 

Referring now to Figure 4, we depict therein 
a map of the RE deletion inserted into pBG312. - In 

20 the modified polypeptide produced on expression the - 
908 amino-acid maturation polypeptide is entirely 
removed. The novel polypeptide produced by this 
recombinant molecule cell is secreted, and may be 
purified as a single chain, i.e., the heavy chain is 

25 linked directly to the light chain. Because the Arg 

1648 - Glu 1649 peptide bond which is normally cleaved 
during the initial nicking of the full-length primary 
translation product is preserved in this deletion, 
the primary translation product for this internal 

30 deletion is nicked by the same protease that initiates 
nicking of the full-length primary translation product, 
thus producing directly the mature form of the heavy 
chain of factor VI 1 1 : C . Our Western blot analysis 
(data not shown) confirms that the RE modified factor 

35 VIII :C encodes a single chain molecule which is then 
processed into a 9 OK heavy chain and an 8 OK light 
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chain in the culture medium. The resultant light 
chain possesses the peptide from Glu 1649 to Arg 
1689 that binds the two-chain complex to von 
Willebrand protein* For this reason, this recombinant 
5 product, when secreted from a mammalian cell, will 
bind to the von Willebrand protein present in cell 
culture fluid. Similarly/ when injected, it will 
complex to and circulate with plasma von Willebrand 
protein. Upon thrombin cleavage at Arg 1689 - Ser 
10 1690, the two-chain mature factor VIII :C will be 
activated and will dissociate from von Willebrand 
protein and assemble into its ternary complex with 
factor IXa and factor X on a platelet surface. 

E. CONSTRUCTION OF THE ARG 740 - SER 1690 
15 (ABBREVIATED RS ) DELETION 

In order to test the outer limits, of these 
deletions, ye constructed a plasmid which codes for 
a polypeptide with a deletion of more than the 
maturation polypeptide alone (i.e., we deleted the 

20 DNA sequence which codes on expression for the forty- 
one amino acids at the N-tenninal end of the light 
chain of mature factor VIII :C). 

We constructed this RS fusion with the 
two-step strategy described above for the RE fusion. 

25 Our first step was a three- fragment ligation resulting 
in an intermediate plasmid. The three fragments 
which we ligated were: 

(1) the 462 bp fragment, obtained by digesting 
the expression plasmid for the full-length gene with 

30 Hin di I I between the codons for Arg 740 and Ser 741, 

removing the 5 1 AGCT with nuclease Si, and subsequently 
digesting with Kpn l which cleaves uniquely between 
the codons for Tyr 586 and Leu 587. 

(2) the synthetic oligonucleotide duplex 
35 fragment: 
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5' pAGC TTT CAA AAG AAA ACA CGA CAC TAT TTT ATT GCT GCA 
TCG AAA GTT TTC TTT TGT GCT GTG ATA AAA TAA CGp 5 1 
Ser Phe Gin Lys Lys Thr Arg His Tyr Phe He Ala Ala 
1690 1702 
5 (3) pUC18 digested with Kpn l and Pstl. 

In tlxis fusion, we recreated the Hind i 1 1 site between 
the codons for Arg 740 and Ser 741 (now Ser 1690). 

We isolated a fragment encoding the RS 
fusion from this intermediate plasmid and used this 

10 fragment in our second step to replace the corre- 
sponding fragment in the expression plasmid for the 
QD fusion- In this second step, we isolated a 501 bp 
fragment encoding the RS fusion. We digested the 
intermediate plasmid with Asp 718 and Pst l and isolated 

15 the fragment encoding the RS fusion. We then used 
t±Le strategy described above for the RE fusion to 
replace the related fragment in the expression plasmid 
for the QD fusion with the 501 bp fragment. 

In addition to removing the entire 

20 maturation polypeptide, the RS deletion removes DNA 

coding for the Glu 1649 - Arg 1689 peptide, the puta- 
tive von Will ebr and binding domain. For this reason 
this recombinant molecule will not attach to circu- 
lating von Willebrand protein when it is secreted 

25 from an animal cell into culture fluid or when it is 
injected into a recipient. 

F. TRANS FECTI ON OF AFRICAN GREEN 
MONKEY KIDNEY CELLS 

We trans fected BMT10 cells [R. D. Gerard 
30 and Y. Gluzman, Mol. Cell. Biol. , 5, pp. 3231-40 

(1985)] with the supercoiled expression plasmid. We 
used the DEAE-dextran technique [L. M. Sompayvac and 
K. J. Danna, PNAS , 78 r pp. 7575-78 (1981)] and 
chloroguine [H. Luthman and G. Magnusson, Nucleic 
35 Acids Research, 11, pp. 1295-1308 (1983)] to trans- 
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fect the cells. Transfectants are known to replicate 
the input expression plasmid to high copy number 
because SV40 T antigen is inducibly supplied in trans 
by BMT10 cells and binds to the SV40 origin of repli- 
5 cation linked to the modified factor VIII :C gene in 
the expression plasmids. However, this technique is 
inefficient because, typically, only several percent 
of the transfected cells will actually incorporate DNA. 
The transfectants will secrete modified 
10 factor VIII :C for up to 120 hours. For most experi- 
ments, the cm 2 /ml ratio is approximately 5.5; that 
is, a confluent monolayer of BMT10 transfectants in 
a 100 mm Petri dish (55 cm 2 ) is covered with 10 ml 
culture fluid. 

15 G. FACTOR VIII :C ACTIV ITY ASSAY 

We assayed the signal -minus, 8.1, QD, RE 
and RS expression constructs for factor VIII :C produc- 
tion after transfection in duplicate into BMT10 cells. 
We used a 96-well plate adaptation of KabiVitrum's 

20 coatest® Factor VIII :C. One petri dish was used to 
prepare RNA for SI analysis and the other petri dish 
was used to prepare Hirt DNA used in our Southern 
analysis. After 120 hours of incubation we assayed 
the cell culture fluids for factor VIII :C activity. 

25 We expressed our results in terms of % plasma level, 
where plasma factor VIII :C concentration is approxi- 
mately 200 ng/ml. 

In repeated trans fections, both the signal- 
minus construct (negative control) and the RS deletion 

30 have shown no detectable factor VIII :C activity. 
This may be explained by the deletion of the von 
Willebrand protein binding domain in the RS deletion. 

In the 120 hour experiment analyzed below, 
cells transfected with the full-length gene produced 

35 approximately 5% of the activity observed with both 
the QD and the RE deletions. The activity observed 
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with the QD deletion was 1-46% plasma level and that 
for the BE deletion was 1.30% plasma level. 

Thus, we observed that BMT10 cells trans- 
fected with the QD and RE deletions produce at least 
5 20 times more factor VIII :C than cells trans fected 
with the full-length gene. 

H. NUCLEASE SI ANALYSIS OF 
FACTOR VIIIiC mRNA 

In order to determine the levels of mRNA 

10 in each construction, we conducted a nuclease SI 

analysis. This assay assists in the determination 
of the reason for the increased level of expression 
in our QD and RE deletions. 

We isolated RNA from 100 mm Petri dish 

15 cultures of EMTIO cells 120 hours after transfection, 
using the unpublished method of W. Schleuning and 
j # Bertonis. Briefly, according to this method, we 
lysed BMT10 cells with 3 ml of 50 mM Tris-HCl (pH 
7.5) - 5 mM EDTA - 1% SDS containing 100 pg/ml 

20 proteinase K for 20 minutes at 37°C, We transferred 
the lysate to a 50 ml conical tuhe containing- -3 ml 
of phenol and then mechanically sheared the DNA for 
15 seconds at high speed in a Polytron (Brinkmann 
Instruments). We extracted the aqueous phase with 

25 ether and adjusted it to 0,25 NaCl. We precipitated 
the nucleic acid fraction at 4°C, by the addition of 
an equal volume of isopropanol, collected it by 
centrifugation and redissolved it in 3 ml of water. 
We selectively precipitated RNA overnight at 4°C, by 

30 adjusting the solution to 2.8 M LiCl. 

We determined the amount of modified factor 
VIII :C mRNA for each construction. We isolated probes 
for the SI analysis by digesting the QD expression 
plasmid with Esp l. We labelled the 5* ends of the 

35 Es£l fragments with [y« 32 P]ATP and T4 polynucleotide 
kinase, and annealed 10 RNA to 5000 cpm of the 
32 P-antisense strand of the 477 nucleotide Esp l frag- 
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ment isolated on a 5% strand separation gel [A. M. 
Max am and W. Gilbert, Methods In Enzymology, 65, 
pp. 499-560 (1980)]. We incubated the RNA overnight 
at 48 °C in 10 pi 80% deionized formamide - 400 mM 
5 NaCl - 40 mM PIPES (pH 6.4) - 1 mM EDTA. The hybrid 
molecules were then digested for 60 minutes at 37°C 
by adding 190 ul nuclease Si at a concentration of 
100 units/ml in 0.28 M NaCl - 50 mM NaOAc (pH 4.6) - 
4.5 mM ZnS04. We terminated the digestion by adding 

10 EDTA to 10 mM and extracting with phenol. We 

denatured the protected fragments and subjected them 
to electrophoresis on a 5% strand separation gel. 
We exposed the dried gel to Kodak XAR-5 X-ray film 
backed by a Lightning-Plus intensifying screen 

15 (Dupont) overnight at -70° C. The 477 nucleotide 

Esp l fragment has one end within the hybrid intron 
spliced out from the 5' untranslated region of the 
factor VIII :C mRNA [R. J. Kaufman and P. A. Sharp, 
J. Mol. Biol. , 159, pp. 601-21 (1981)] and the other 

20 end wi thin the codon for Ala 62 (Figure 4). 

We detected modified factor VIII :C mRNA by 
protecting a single- stranded 300 nucleotide DNA frag- 
ment from digestion. The experiment was repeated 
with 1 pg RNA in order to verify that the single- 

25 stranded probe was in excess. 

The results of nuclease Si analysis of 
modified factor VIII :C mRNA for each construct are 
shown in Figure 5. Our results indicated that modi- 
fied factor VIII :C mRNA levels are the same for all 

30 three deletions and the full-length factor VIII :C 

gene. Figure 5A is the analysis for 10 ug of input 
RNA, and Figure 5B is the analysis for 1 ug of input 
RNA. Lane 1 in both figures contains as marker 500 
cpm of the labeled 477 nucleotide single-stranded 

35 DNA fragment used to protect modified factor VIII :C 
mRNA from Si digestion; that is, 10% of the input to 
each hybridization reaction. Lane 2 contained RNA 
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isolated from BMT10 cells transfected with the signal- 
minus construct; lane 3, BMT10 cells transfected 
with the full-length factor VIII :C cDNA (construct 
8.1); lane 4: BMT10 cells transfected with modified 
5 factor VIII :C cDNA (QD deletion); lane 5: BMT10 cells 
transfected with modified factor VIII :C cDNA (RE 
deletion); lane 6: BMT10 cells transfected with the 
cDNA from the RS deletion; lane 7z marker fragments 
obtained by digesting pBR322 with Hinfl and labeling 

10 their 3 f ends with [a- dATF and Klenow enzyme (a 

gift of Richard Tizard). . Equal amounts of a protected 
fragment of the expected length of 300 bases are 
evident in both figures for the 8.1, QD, RE, and RS . 
constructs • A protected fragment of approximately 

15 220 bases in length for the signal -minus construct 
is evident in both figures, reflecting the absence 
of a portion of the DNA sequence encoding the signal 
peptide . 

A comparison of Figures 5A and SB demon- 
20 strates that the input 477 probe is in molar excess 
during the hybridizations for each construct. 
Although the modified factor VIII :C activity levels 
are at least 20-fold higher for the QD and RE dele- 
tions compared to the RS and the full-length con- 
25 structs, the amount of mRNA in all . foiir " constructs 
is very nearly the same. Therefore, the reason for 
the increase in expression for the QD and. RE deletions 
is post- transcriptional in nature. 

I. SOUTHERN ANALYSIS OF PLASMID 
30 DNA ISOLATED FROM TRANSFECTED 
. BMT10 CELLS 

We conducted this analysis to determine 
the DNA levels of newly-replicated modified factor 
VIII:C plasmids for our deletions, in comparison 
35 with the full-length gene. Again, this assay assisted 
in our determination of the reason for the high yields 
of modified factor VIII :C in our QD and RE deletions. 
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In order to control for differenc s in DNA 
replication in BMT10 cells for the various constructs, 
we performed a Southern analysis of extrachromosomal 
DNA isolated from each transfection. We isolated DNA 
5 from 100 mm petri dish cultures of BMT10 cells 120 
hours after transfection according to the method of 
Hirt [B. Hirt, J. Mol. Biol. , 26, pp. 365-69 (1967)]. 
For each construction, we digested 0.5 A260 units 
with Dpn l to distinguish newly-replicated (DesI- 

10 resistant) DNA from input methylated bacterial DNA 
( Dpn I-sensitive) . We electrophoresed the DNA frag- 
ments on a 0.7% agarose gel, and blotted them to 
GeneScreen Plus to analyze the DNA. The filter was 
hybridized at 65°C in 1 M NaCl - 50 mM Tris-HCl (pH 

15 7.5) - 0.1% sodium pyrophosphate - 0.2% polyvinyl- 
pyrrolidone - 0.2% Ficoll - 0.2% BSA - 1% SDS using 
10 5 cpm/ml denatured probe. We then washed the 
filter at 65 °C with the same buffer and exposed it 
overnight at -70°C to Kodak XAR-5 X-ray film backed 

20 by a Lightning-Plus intensifying screen (Dupont). 

The factor VIII :C probe was the 2924 bp Es£l fragment 
isolated from the RE expression plasmid (see Figure 4) 
and 32 P-labeled to a specific activity of 10 cpm/pg 
by the random hexadeoxynucleotide primer method of 

25 Feinberg and Vogelstein [A. P. Feinberg and 

B. Vogelstein, Anal. Biochem. , 132, pp. 6-13 (1983)]. 

Our results, which are depicted in Figure 6, 
indicate that newly-replicated modified factor VIII :C 
plasmid DNA levels are the same for all three dele- 

30 tions and the full-length gene. Lane 1 contained the 
1 kb ladder obtained from BRL and labeled with T4 
DNA polymerase according to the manufacturer's proto- 
col; lane 2: 1 ng supercoiled RE DNA; lane 3: 10 ng 
supercoiled RE DNA; lane 4: 10 ng RE DNA digested 

35 with Dpnl; lane 5: Dpn l digest of 0.5 A260 units 

Hirt fraction obtained from BMT10 cells transfected 
with the signal-minus construct; lane 6: transfected 
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with the full-length factor VIII :C cDNA (construct 
8-1); lane 7: trans fected with the QD deletion; lane 
8: trans fected with the RE deletion: lane 9: trans- 
fected with the RS deletion. Figure 6 shows nearly 
5 * equal amounts of the supercoiled form of each con- 
struct after digestion with Dpn l ( lanes * 5-9 ) , thus 
excluding the possibility that differences in DNA 

replication enhance the expression of the QD and RE 

8 

deletions* Lane 2 contains 10 molecules of the RE 

9 

10 construct and lane 3 contains 10 molecules , suggest- 
ing that the copy number is approximately 10 3 in the 
approximately 10 5 cells successfully transfected. 

J, CONSTRUCTION OF ARG 740 -ASP 1658 
(ABBREVIATED RD) DELETION 

15 In this section, we demonstrate how we 

created the RD deletion which removes the DNA. 
sequence coding on expression from Ser 741 to Ser 
1657* We constructed this RD deletion fusion in 
three steps- In the first step, we digested plasmid 

20 QD (Figure 2) with Sau3 A between the codons for Ser 
1657 and Asp 1658 and between Glu 1794 and Asp 1795. 
This produced a 411 bp fragment* We also linearized 
plasmid tsa pML [L. Dailey et al., J* Virol, 54 , 
pp- 739-49 (1985)] at the unique Bell site. We then 

25 ligated the 411 base pair fragment derived from 

plasmid QD with T4 DNA ligase (the Ugase for this 
and the following examples) to the linearized tsa 
pML at the unique Bel l site to generate plasmid 
411. Bel l , which contains the Bel l site on the Asp 

30 1658 side of the 411 bp insert (i.e., 5 f to the 

sequence encoding Asp 1658). Plasmid 411. Bell may 
be linearized uniquely with Bel l , resulting in a 5 f 
GATC overhang which consists of the GAT codon for 
Asp 1658 and the first base of the CAA codon for Gin 

35 1659. 
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We also digested plasmid QD with Hindi 1 1 
to cleave the plasmid between Arg 740 and Ser 741 
and within the codon for Glu 321 to generate a 1258 
bp fragment. We then removed the 5« AGCT overhang 
5 with mung bean nuclease and ligated it to the Bcll- 
linearized 411. Bell fragment which had previously 
been rendered flush by treatment with Klenow enzyme 
and all four deoxynucleoside triphosphates. This 
resulted in plasmid HD.411, which contains an Asp_718 
10 site 5 1 to the fusion site within the 1258 bp Hindi 1 1 
fragment. RD.411 contains a Pstl site 3' to the 
fusion site within the 411 bp Sau3A fragment. 

Subsequently, we digested plasmid RE 
(Figure 3B) with Asp718 to cleave within the codon 

15 for Trp 585. 

We then dephosphorylated the 5« GTAC over- 
hang with calf intestinal phosphatase and then par- 
tially digested with Pstl. This partial digestion 
cleaved the linearized RE plasmid between the codons 

20 for Ala 1702 and Val 1703, thus removing a 628 bp 
fragment spanning the RE fusion. 

We then cleaved plasmid RD.411 with As£718 
and Pstl to generate a. 601 bp fragment spanning the 
RD fusion. We then ligated this fragment to the 

25 As£718 -cleaved, Pstl -partially cleaved RE plasmid 
DNA to generate plasmid RD. As demonstrated below, 
plasmid RD directed the expression of a factor VIII 
polypeptide with a fusion between Arg 740 and Asp 
1658. Cleavage of the RD polypeptide after Arg 740 

30 generates a twochain factor VIII molecule with a 

mature heavy chain calciumbridged to a 59 light chain, 
i.e. a light chain lacking the first 9 amino-terminal 
amino acids . 

K. CONSTRUCTION OF ARG 740-SER 1657 
35 (ABBREVIATED RSD DELETION) . 

In this section, we demonstrate how we 
created the RSD deletion which removes the DNA 
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seguence coding on expression for Ser 741 to Gin 
1656 of the mature polypeptide- Initially , we con- 
structed plasmid 411. Bel I and linearized it with 
Bel l as described in example "J*. Subsequently, we 
5 digested plasmid QD with Hind i II y cleaving the plasmid 
between the codons for Arg 740 and Ser .741 and within 
the codon for Glu 321 to generate a 1258 bp fragment. 
We preserved the AGC codon within the 5 f AGCT over- 
hang with Klenow enzyme and dATP, dGTP and dCTP and 

10 then removed the leftover 5* T overhang .with mung 

bean nuclease. We then ligated this modified Hind i I I 
fragment to Bell-linearized 411. Bell , which had been 
previously treated with Klenow enzyme and all four 
deoxynucleoside triphosphates, to produce plasmid 

15 RSD. 411, which contains an Asp 718 site 5 1 to the 

fusion site within the 1258 bp Hindlll fragment and 
a PstI site 3 1 to the fusion site within the 411 bp - 
Sau3 A fragment. 

We then prepared Asp7 18-cieaved, Pstl- 

20 partially cleaved RE plasmid DNA as described in 
example "D". Subsequently, we cleaved plasmid 
RSD. 411 with Asp7 18 and Pst I and ligated the result- 
ing 604 bp fragment spanning the RSD fusion to the 
Asp 718-cleaved, Pst I -partially cleaved RE plasmid 

25 DNA to generate plasmid RSD. Upon^ejspression, the 

RSD plasmid encoded a factor VIII polypeptide with a 
fusion between Arg 740 and Ser 1657. A cleavage of 
RSD polypeptide after Arg 740 generates a 2-chain 
factor VIII molecule with a mature heavy chain and a ■ 

30 delta 8 light chain, i.e. a light chain lacking the 
first eight amino terminal amino acids. Furthermore, 
because in the primary translation product Ser is 
also at position 741, RSD may also be viewed as a 
fusion between Ser 741 and Asp 1658. A cleavage 

35 after Ser 741 may generate a 2-chain factor VIII 

molecule with a heavy chain terminating at Ser 741 
and a 69 light chain. 
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L. TRANSFECTION OF AFRICAN 

GREEN MONKEY KIDNEY CELLS 

We first produced African green monkey 
kidney cell line 6L by cotransfecting cell line BSC40 
5 (BSC1 African green monkey kidney cells which have 
been adapted to grow at 40°C), [W. Brackman and 
D. Nathan, Proc. Natl. A«»d. Sci. USA, 71, pp. 942-46 
(1974)] with pLTRtsA58 and with pY3, which has a 
transcription unit for hygromycin B phosphotranf erase 

10 [K. Blochlinger, and A. Diggelmann, Mol. Cell Biol. , 
4, p. 2929-31 (1984)]. Plasmid LTRtsA58 contains 
a' transcription unit for a temperature sensitive 
SV40T- antigen allele. A mutant tsA58 virus is a 
temperature-sensitive mutant of SV40 which does not 

15 produce progeny at 39°C. The large T-antigen protein 
specified by the tsA58 mutant is much more labile at 
the nonpermissive temperature than wild type large 
T-antigen protein [H. Tegtmeyer et al., J. Virol 
16, pp. 168-78 (1975). The resulting cell line 6L 

20 inducibly expresses SV40 T-antigen at 33 °C. 

We then transfected 6L cells with super- 
coiled expression plasmids RD or RSD. The transfec- 
tion was carried out using the DEAE-dextran technique 
and chloroguine as described in Example »F". We 

25 tben incubated the transfected cells at 33»C. During 
incubation, the transfected cells synthesized and 
secreted modified factor VIII :C into the culture 
fluid. The transfectants will secrete modified factor 
VIII :C for up to 120 hours. For most assays, the 

30 cm 2 /ml ratio was approximately 5.5; that is, a con- 
fluent monolayer of 6L transfectants in a 100mm Petri 
dish (55cm 2 ) was covered with 10ml culture fluid. 

M. FACTOR VIII :C ACTIV ITY ASSAY 

We assayed the RE (Example D), RD and RSD 
35 expression constructs for factor VIII :C production 
after transfection and incubation at 33 °C for three 
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days using KabiVitrum f s Coatest® factor VIII assay 
kit adapted to a 96 well plate* Cells transfected 
with plasmid RE produced culture fluid having a factor 
VIII concentration which was 0,48% plasma level 
5 [normal plasma factor VIII concentration is approxi- 
mately 150 ng/ml]- Cells transfected with plasmid 
ED produced culture fluid having a factor VIII con- 
centration which was 0.41% plasma level* Cells trans- 
fected with plasmid RSD produced culture fluid having 
10 a factor VIII concentration which was 0.71% plasma 
level • 

In" a similar assay, cells transfected with 
plasmids RE or RSD which had been incubated at 33 °C 
for three days and then for an additional two days, 
15 yielded the following factor VIII concentrations in 
the cell culture fluid r 

Factor VIII :C Concentration In Culture 
Fluid As % Of Plasma Level 

3 Days ^ 5 Days 

20 RE Transfected Cells 0.30% 0.77% 

RSD Transfected Cells 1.50% 1.16% 

Microorganisms, recombinant DNA molecules 
and the modified factor VIII :C DNA coding sequences 
of this invention are exemplified by a culture depo- 
25 sited in the culture collection of the American Type 
Culture Collection,, in Rockville, Maryland, on 
July 22, 1986, and identified there as: 



30 



E.coli HB101 (RE) 

This culture was assigned ATCC accession number 53517. 
Two additional cultures were deposited in the American 
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Type culture Collection., in Roclcville, Maryland on 
July 27, 1987, and identified there as: 

Ad.RD.2 f E.coli HB101 (RD)], having ATCC 
accession number 67475; and Ad.RSD.1.2 fE.coli HB101 
5 ( RSD ) ] , having ATCC accession number 67476. 

While we have hereinbefore presented a 
number of embodiments of this invention, it is 
apparent that our basic construction can be altered 
to provide other embodiments which utilize the pro- 
10 cesses and compositions of this invention. Therefore, 
it will be appreciated that the scope of this inven- 
tion is to be defined by the claims appended hereto 
rather than by the specific embodiments which have 
been presented hereinbefore by way of example. 
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We claim: 

1. A recombinant DNA molecule character- 
ized by a DNA sequence coding on expression for a 
modified factor VIII:C-like polypeptide, said DNA 

5 sequence containing a deletion of a major part of 
tlie DNA sequence which codes on expression for the 
maturation polypeptide of factor VIII :C. 

2. The recombinant DNA molecule according 
to claim 1, wherein the deletion is all of the DNA 

10 sequence which codes on expression for the maturation 
polypeptide of factor VIII :C. 

3* The recombinant DNA molecule according 
to claim. 1, wherein the DNA sequence coding on 
expression for the modified factor VIII:C-like poly- 

15 peptide is selected from the group consisting of: 

ATG GCC ACC AGA AGA TAC TAC CTG GGT GCA GTG GAA CTG 
TCA TGG GAC TAT ATG CAA AGT GAT CTC GGT GAG CTG CCT 
GTG GAC GCA AGA TTT CCT CCT AGA GTG CCA AAA TCT TTT 
CCA TTC AAC ACC TCA GTC GTG TAC AAA AAG ACT CTG TTT 

20 GTA GAA TTC ACG GAT CAC CTT TTC AAC ATC GCT AAG CCA 
AGG CCA CCC TGG ATG GGT CTG CTA GGT CCT ACC ATC CAG 
GCT GAG GTT TAT GAT ACA GTG GTC ATT ACA CTT AAG AAC 
ATG GCT TCC CAT CCT GTC AGT CTT CAT GCT GTT GGT GTA 
TCC TAC TGG AAA GCT TCT GAG GGA GCT GAA TAT GAT GAT 

25 CAG ACC AGT CAA AGG GAG AAA GAA GAT GAT AAA GTC TTC : 
CCT GGT GGA AGC CAT ACA TAT GTC TGG CAG GTC CTG AAA 
GAG AAT GGT CCA ATG GCC TCT GAC CCA CTG TGC CTT ACC 
TAC TCA TAT CTT TCT CAT GTG GAC CTG GTA AAA GAC TTG 
AAT TCA GGC CTC ATT GGA GCC CTA CTA GTA TGT AGA GAA 

30 GGG AGT CTG GCC AAG GAA AAG ACA CAC ACC TTG CAC AAA 
TTT ATA CTA CTT TTT GCT GTA TTT GAT GAA GGG AAA AGT 
TGG CAC TCA GAA ACA AAG ^ AAC TCC TTG ATG CAG GAT AGG 
GAT GCT GCA TCT GCT CGG GCC TGG CCT AAA ATG CAC ACA 
GTC AAT GGT TAT 7TA AAC AGG TCT CTG (CTA) CCA GGT CTG 

35 ATT GGA TGC CAC AGG AAA TCA GTC TAT TGG CAT GTG ATT 
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GGA ATG GGC ACC ACT CCT GAA GTG CAC TCA ATA TTC CTC 
GAA GGT CAC ACA TTT CTT GTG AGG AAC CAT CGC CAG GCG 
TCC TTG GAA ATC TCG CCA ATA ACT TTC CTT ACT GCT CAA 
ACA CTC TTG ATG GAC CTT GGA CAG TTT CTA CTG TTT TGT 
CAT ATC TCT TCC CAC CAA CAT GAT GGC ATG GAA GCT TAT 
GTC AAA GTA GAC AGC TGT CCA GAG GAA CCC CAA CTA CGA 
ATG AAA AAT AAT GAA GAA GCG GAA GAC TAT GAT GAT GAT 
CTT ACT GAT TCT GAA ATG GAT GTG GTC AGG TTT GAT GAT 
GAC AAC TCT CCT TCC TTT ATC CAA ATT CGC TCA GTT GCC 
AAG AAG CAT CCT AAA ACT TGG GTA CAT TAC ATT GCT GCT 
GAA GAG GAG GAC TGG GAC TAT GCT CCC TTA GTC CTC GCC 
CCC GAT GAC AGA AGT TAT AAA AGT CAA TAT TTG AAC AAT 
GGC CCT CAG CGG ATT GGT AGG AAG TAC AAA AAA GTC CGA 
TTT ATG GCA TAC ACA GAT GAA ACC TTT AAG ACT CGT GAA 
GCT ATT CAG CAT GAA TCA GGA ATC TTG GGA CCT TTA CTT 
TAT GGG GAA GTT GGA GAC ACA CTG TTG ATT ATA TTT AAG 
AAT CAA GCA AGC AGA CCA TAT AAC ATC TAC CCT CAC GGA 
ATC ACT GAT GTC CGT CCT TTG TAT TCA AGG AGA TTA CCA 
AAA GGT GTA AAA CAT TTG AAG GAT TTT CCA ATT CTG CCA 
GGA GAA ATA TTC AAA TAT AAA TGG ACA GTG ACT GTA GAA 
GAT GGG CCA ACT AAA TCA GAT CCT CGG TGC CTG' ACC CGC 
TAT TAC TCT AGT TTC GTT AAT ATG GAG AGA GAT CTA GCT 
TCA GGA CTC ATT GGC CCT CTC CTC ATC TGC TAC AAA GAA 
TCT GTA GAT CAA AGA GGA AAC CAG ATA ATG TCA GAC AAG 
AGG AAT GTC ATC CTG TTT TCT GTA TTT GAT GAG AAC CGA 
AGC TGG TAC CTC ACA GAG AAT ATA CAA CGC TTT CTC CCC 
AAT CCA GCT GGA GTG CAG CTT GAG GAT CCA GAG TTC CAA 
GCC TCC AAC ATC ATG CAC AGC ATC AAT GGC TAT GTT TTT 
GAT AGT TTG CAG TTG TCA GTT TGT TTG CAT GAG GTG GCA 
TAC TGG TAC ATT CTA AGC ATT GGA GCA CAG ACT GAC TTC 
CTT TCT GTC TTC TTC TCT GGA TAT ACC TTC AAA CAC AAA 
ATG GTC TAT GAA GAC ACA CTC ACC CTA TTC CCA TTC TCA 
GGA GAA ACT GTC TTC ATG TCG ATG GAA AAC CCA GGT CTA 
TGG ATT CTG GGG TGC CAC AAC TCA GAC TTT CGG AAC AGA 
GGC ATG ACC GCC TTA CTG AAG GTT TCT AGT TGT GAC AAG 
AAC ACT GGT GAT TAT TAC GAG GAC AGT TAT GAA GAT ATT 
TCA GCA TAC TTG CTG AGT AAA AAC AAT GCC ATT GAA CCA 
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AGA AGC TTC TCC CAG GAT CCT 
TAT GGT ACT CAG ATA CCA AAA 
GAG AAG TCA CCA GAA AAA ACA 
ACC ATT TTG TCC CTG AAC GCT 
5 ATA GCA GCA ATA AAT GAG GGA 
GAA GTC ACC TGG GCA AAG CAA 
TGC TCT CAA AAC CCA CCA GTC 
GAA ATA ACT CGT ACT. ACT CTT 
ATT GAC TAT GAT GAT ACC ATA 

10 GAA GAT TTT GAC ATT TAT GAT 
CCC CGC AGC TTT CAA AAG AAA 
GCT GCA GTG GAG AGG CTC TGG 
TCC CCA CAT GTT CTA AGA AAC 
GTC CCT CAG TTC AAG AAA GTT 

15 GAT GGC TCC TTT ACT CAG CCC 
AAT GAA CAT TTG GGA CTC CTG 
GAA GTT GAA GAT AAT ATC ATG 
GCC TCT CGT CCC TAT TCC TTC 
TAT GAG GAA GAT CAG AGG CAA 

20 AAC TTT GTC AAG CCT AAT GAA 
AAA GTG CAA CAT CAT ATG GCA 
GAC TGC AAA GCC TGG GCT TAT 
GAA AAA GAT GTG CAC TCA GGC 
GTC TGC CAC ACT AAC ACA CTG 

25 CAA GTG ACA GTA CAG GAA TTT 
ATC TTT GAT GAG ACC AAA AGC 
ATG GAA AGA AAC TGC AGG GCT 
GAA GAT CCC ACT TTT AAA GAG 
ATC AAT GGC TAC ATA ATG GAT 

30 ATG GCT CAG GAT CAA AGG ATT 
ATG GGC AGC AAT GAA AAC ATC 
GGA CAT GTG TTC ACT GTA CGA 
ATG GCA CTG TAC AAT CTC TAT 
GTG GAA ATG TTA CCA TCC AAA 

35 GAA TGC CTT ATT GGC GAG CAT 
ACA CTT TTT CTG GTG TAC AGC 
CTG GGA ATG GCT TCT GGA CAC 
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ACA GCT TCA GGA CAA TAT GGA 
GCC AGA CTT CAT TAT TCC GGA 
ACC AAG GAG CCC TTT TCT TGG 
GCA CCA ATG ATT ATT CAC GGC 
CGT CAG AAG TTC TCC AGC CTC 
ATC ATG TAT AGT CTT GAT GGG 
CGA GGA AAT TCC ACT GGA ACC 
AAT GTG GAT TCA TCT GGG ATA 
CCT CCA ATT ATT GCT CGA TAC 
CAT TAT AGC ATT CGC AGC ACT 
GGC TGT GAT TTA AAT AGT TGC 
GAG AGT AAA GCA ATA TCA GAT 
TCC TAC TTT ACC AAT ATG TTT 
AAA GCT CGA CTT CAC CTC CAA 
AGA CCT CAG GTG AAT AAT CCA 
GAC TTC CAG AAG ACA ATG AAA 
CAG GGA GTA AAA TCT CTG CTT 
GAG TTC CTC ATC TCC AGC AGT 
ACT CTC TTT TTT CAG AAT GGC 
GGA AAT CAA GAC TCC TTC ACA 
GAC CCA CCG TTA CTG ACT CGC 
CAG AGT TGG GTG CAC CAG ATT 
CTG GGC TGC GAG GCA CAG GAC 
GCC ACC AGA AGA TAC TAC CTG 
TGG GAC TAT ATG CAA AGT GAT 
GAC GCA AGA TTT CCT CCT AGA 
TTC AAC ACC TCA GTC GTG TAC 
GAA TTC ACG GAT CAC CTT TTC 
CCA CCC TGG ATG GGT CTG CTA 
GAG GTT TAT GAT ACA GTG GTC 
GCT TCC CAT CCT GTC AGT CTT 
TAC TGG AAA GCT TCT GAG GGA 
ACC AGT CAA AGG GAG AAA GAA 
GGT GGA AGC CAT ACA TAT GTC 
AAT GGT CCA ATG GCC TCT GAC 
TCA TAT CTT . TCT CAT GTG GAC 
TCA GGC CTC ATT GGA GCC CTA 



CAG TGG GCC CCA AAG CTG 
TCA ATC AAT GCC TGG AGC 
ATC AAG GTG GAT CTG TTG 
ATC AAG ACC CAG GGT GCC 
TAC ATC TCT CAG TTT ATC 
AAG AAG TGG CAG ACT TAT 
TTA ATG GTC TTC TTT GGC 
AAA CAC AAT ATT TTT AAC 
ATC CGT TTG CAC CCA ACT 
CTT CGC ATG GAG TTG ATG 
AGC ATG CCA TTG GGA ATG 
GCA CAG ATT ACT GCT TCA 
GCC ACC TGG TCT CCT TCA 
GGG AGG AGT AAT GCC TGG 
AAA GAG TGG CTG CAA GTG 
GTC ACA GGA GTA ACT ACT 
ACC AGC ATG TAT GTG AAG 
CAA GAT GGC CAT CAG TGG . 
AAA GTA AAG GTT TTT CAG 
CCT GTG GTG AAC TCT CTA 
TAC CTT CGA ATT CAC CCC 
GCC CTG AGG ATG GAG GTT 
CTC TAC; 

GGT GCA GTG GAA CTG TCA 
CTC GGT GAG CTG CCT GTG 
GTG CCA AAA TCT TTT CCA 
AAA AAG ACT CTG TTT GTA 
AAC ATC GCT AAG CCA AGG 
GGT CCT ACC ATC CAG GCT 
ATT ACA CTT AAG AAC ATG 
CAT GCT GTT GGT GTA TCC 
GCT GAA TAT GAT GAT CAG 
GAT GAT AAA GTC TTC CCT 
TGG CAG GTC CTG AAA GAG 
CCA CTG TGC CTT ACC TAC 
CTG GTA AAA GAC TTG AAT 
CTA GTA TGT AGA GAA GGG 
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TCC TAC TGG AAA GCT TCT GAG GGA GCT GAA TAT GAT GAT 
CAG ACC AGT CAA AGG GAG AAA GAA GAT GAT AAA GTC TTC 
CCT. GGT GGA AGC CAT ACA TAT GTC TGG CAG GTC CTG AAA 
GAG AAT GGT CCA ATG GCC TCT GAC CCA CTG TGC CTT ACC 
TAC TCA TAT CTT TCT CAT GTG GAC CTG GTA AAA GAC TTG 
AAT TCA GGC CTC ATT GGA GCC CTA CTA GTA TGT AGA GAA 
GGG AGT CTG GCC AAG GAA AAG ACA CAC ACC TTG CAC AAA 
TTT ATA CTA CTT TTT GCT GTA TTT GAT GAA GGG AAA AGT 
TGG CAC TCA GAA ACA AAG AAC TCC TTG ATG CAG GAT AGG 
GAT GCT GCA TCT -GCT CGG GCC TGG CCT AAA ATG CAC ACA 
GTC AAT GGT TAT GTA AAC AGG TCT CTG (CTA) CCA GGT CTG 
ATT GGA TGC CAC AGG AAA TCA GTC TAT TGG CAT GTG ATT 
GGA ATG GGC ACC ACT CCT GAA GTG CAC TCA ATA TTC CTC 
GAA GGT CAC ACA TTT CTT GTG AGG AAC CAT CGC CAG GCG 
TCC TTG GAA ATC TCG CCA ATA ACT TTC CTT ACT GCT CAA 
ACA CTC TTG ATG GAC CTT GGA CAG TTT CTA CTG TTT TGT 
CAT ATC TCT. TCC CAC CAA CAT GAT GGC ATG GAA GCT TAT 
GTC AAA GTA GAC AGC TGT CCA GAG GAA CCC CAA CTA CGA 
ATG AAA AAT AAT GAA GAA GCG GAA GAC TAT GAT GAT GAT 
CTT ACT GAT TCT GAA ATG GAT GTG GTC AGG TTT GAT GAT 
GAC AAC TCT CCT TCC TTT ATC CAA ATT CGC TCA GTT GCC 
AAG AAG CAT CCT AAA ACT TGG GTA CAT TAC ATT GCT GCT 
GAA GAG GAG GAC TGG GAC TAT GCT CCC TTA GTC CTC GCC 
CCC GAT GAC AGA AGT TAT AAA AGT CAA TAT TTG AAC AAT 
GGC CCT CAG CGG ATT GGT AGG AAG TAC AAA AAA GTC CGA 
TTT ATG GCA TAC ACA GAT GAA ACC TTT AAG ACT CGT GAA 
GCT ATT CAG CAT GAA TCA GGA ATC TTG GGA CCT TTA CTT 
TAT . GGG GAA GTT GGA GAC ACA CTG TTG ATT ATA TTT AAG 
AAT CAA GCA AGC AGA CCA TAT AAC ATC TAC CCT CAC GGA 
ATC ACT GAT GTC CGT CCT TTG TAT TCA AGG AGA TTA CCA 
AAA GGT GTA AAA CAT TTG AAG GAT TTT CCA ATT CTG CCA 
GGA GAA ATA TTC AAA TAT AAA TGG ACA GTG ACT GTA GAA 
GAT GGG CCA ACT AAA TCA GAT CCT CGG TGC CTG ACC CGC 
TAT TAC TCT AGT TTC GTT AAT ATG GAG AGA GAT CTA GCT 
TCA GGA CTC ATT GGC CCT CTC CTC ATC TGC TAC AAA GAA 
TCT GTA GAT CAA AGA GGA AAC CAG ATA ATG TCA GAC AAG 
AGG AAT GTC ATC CTG TTT TCT GTA TTT GAT GAG AAC CGA 
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AGC TGG TAC CTC ACA GAG AAT 
AAT CCA GCT GGA GTG GAG CTT 
GCC TCC AAC ATC ATG CAC AGC 
GAT AGT TTG CAG TTG TCA GTT 
5 TAC TGG TAC ATT CTA AGC. ATT 
CTT TCT GTC TTC TTC TCT GGA 
ATG- GTC TAT GAA GAC ACA CTC 
GGA GAA ACT GTC TTC ATG TCG 
TGG ATT CTG GGG TGC CAC AAC 

10 GGC ATG ACC GCC TTA CTG AAG 
AAC ACT GGT GAT TAT TAC GAG 
TCA GGA TAC TTG CTG AGT AAA 
AGA GAA ATA ACT CGT ACT ACT 
GAA ATT GAC TAT GAT GAT ACC 

15 AAG GAA GAT TTT GAC ATT TAT 
AGC CCC CGC AGC TTT CAA AAG 
ATT GCT GGA GTG GAG AGG CTC 
AGC TCC CCA CAT GTT CTA AGA 
AGT GTC CCT CAG TTC AAG AAA 

20 ACT GAT GGC TCC TTT ACT CAG 
CTA AAT GAA CAT TTG GGA CTC 
GCA GAA GTT GAA GAT AAT ATC 
CAG GCC TCT CGT CCC TAT TCC 
TCT TAT GAG GAA GAT CAG AGG 

25 AAA AAC TTT GTC AAG CCT AAT 
TGG AAA GTG CAA CAT CAT ATG 
TTT GAC TGC AAA GCC TGG GCT 
CTG GAA AAA GAT GTG CAC TCA 
CTG GTC TGC CAC ACT AAC ACA 

30 AGA CAA GTG ACA GTA CAG GAA 
ACC ATC TTT GAT GAG ACC AAA 
AAT ATG GAA AGA AAC TGC AGG 
ATG GAA GAT CCC ACT TTT AAA 
GCA ATC AAT GGC TAC ATA ATG 

35 GTA ATG GCT CAG GAT CAA AGG 
AGC ATG GGC AGC AAT GAA AAC 
AGT GGA CAT GTG TTC ACT GTA 
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AAA ATG GCA CTG TAC AAT CTC TAT CCA GGT GTT TTT GAG 
ACA GTG GAA ATG TTA CCA TCC AAA GCT GGA ATT TGG CGG 
GTG GAA TGC CTT ATT GGC GAG CAT CTA CAT GCT GGG ATG 
AGC ACA CTT TTT CTG GTG TAC AGC AAT AAG TGT CAG ACT 
CCC CTG GGA ATG GCT TCT GGA CAC ATT AGA GAT TTT CAG 
ATT ACA GCT TCA GGA CAA TAT GGA CAG TGG GCC CCA AAG 
CTG GCC AGA CTT CAT TAT TCC GGA TCA AT C. AAT" GCC1TGG 
AGC ACC AAG GAG CCC TTT TCT TGG ATC AAG GTGT GAT*.CTG 
TTG GCA CCA ATG ATT ATT CAC GGC ATC' SAftl ACC. CAG- GGT 
GCC CGT CAG AAG TTC TCC AGC CTC TAC ATC TCT .'CAG^l^T^ 
ATC ATC ATG TAT AGT CTT GAT GGG AAG AAGT TGCT CKG~ 
TAT CGA GGA AAT TCC ACT GGA ACC TTA ATGL". GTC TXC^TTT 
GGC AAT GTG GAT TCA TCT GGG ATA AAA CAC AAT, ATT: TTT 
AAC CCT CCA ATT ATT GCT CGA TAC ATC CGT TTG CAC CCA 
ACT CAT TAT AGC ATT CGC AGC ACT CTT CGC ATG GAG TTG 
ATG GGC TGT GAT TTA AAT AGT TGC AGC ATG CCA TTG GGA 
ATG GAG AGT AAA GCA ATA TCA GAT GCA CAG ATT ACT GCT 
TCA TCC TAC TTT ACC AAT ATG TTT GCC ACC TGG TCT CCT 
TCA AAA GCT CGA CTT CAC CTC CAA GGG AGG AGT AAT GCC 
TGG AGA CCT CAG GTG AAT AAT CCA AAA GAG TGG CTG CAA 
GTG GAC TTC CAG AAG ACA ATG AAA GTC ACA GGA GTA ACT 
ACT CAG GGA GTA AAA TCT CTG CTT ACC AGC ATG TAT GTG 
AAG GAG TTC CTC ATC TCC AGC AGT CAA GAT GGC CAT CAG 
TGG ACT CTC TTT TTT CAG AAT GGC AAA GTA AAG GTT TTT 
CAG GGA AAT CAA GAC TCC TTC ACA CCT GTG GTG AAC TCT 
CTA GAC CCA CCG TTA CTG ACT CGC TAC CTT CGA ATT CAC 
CCC CAG AGT TGG GTG CAC CAG ATT GCC CTG AGG ATG GAG 
GTT CTG GGC TGC GAG GCA CAG GAC CTC TAC; and 
GCC ACC AGA AGA TAC TAC CTG GGT GCA GTG GAA CTG TCA 
TGG GAC TAT ATG CAA AGT GAT CTC GGT GAG CTG CCT GTG 
GAC GCA AGA TTT CCT CCT AGA GTG CCA AAA TCT TTT CCA 
TTC AAC ACC TCA GTC GTG TAC AAA AAG ACT CTG TTT GTA 
GAA TTC ACG GAT CAC CTT TTC AAC ATC GCT AAG CCA AGG 
CCA CCC TGG ATG GGT CTG CTA GGT CCT ACC ATC CAG GCT 
GAG GTT TAT GAT ACA GTG GTC ATT ACA CTT AAG AAC ATG 
GCT TCC CAT CCT GTC AGT CTT CAT GCT GTT GGT GTA TCC 
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TAC TGG AAA GCT TCT GAG GGA 
ACC AGT CAA AGG GAG AAA GAA 
GGT GGA AGO CAT ACA TAT GTC 
AAT GGT CCA ATG GCC TCT GAC 
5 TCA TAT CTT TCT CAT GTG GAC 
TCA GGC CTC ATT GGA GCC CTA 
AGT CTG GCC AAG GAA AAG ACA 
ATA CTA CTT TTT GCT GTA "TTT 
CAC TCA GAA ACA AAG AAC TCC 

1G GCT GCA TCT GCT CGG GCC TGG 
AAT GGT TAT GTA AAC AGG TCT 
GGA TGC CAC AGG AAA TCA GTC 
ATG GGC ACC ACT CCT GAA GTG 
GGT CAC ACA TTT CTT GTG AGG 

15 TTG GAA ATC TCG CCA ATA ACT 
CTC TTG ATG GAC CTT GGA CAG 
ATC TCT TCC CAC CAA CAT GAT 
AAA GTA GAC AGC TGT CCA GAG 
AAA AAT AAT GAA GAA GCG GAA 

20 ACT GAT TCT GAA ATG GAT GTG 
AAC TCT CCT TCC TTT ATC. CAA* 
AAG CAT CCT AAA ACT* TGG GTA 
GAG GAG GAC TGG GAC TAT GCT 
GAT GAC AGA AGT TAT AAA AGT 

25 CCT CAG CGG ATT GGT AGG AAG 
ATG GCA TAC ACA GAT GAA ACC 
ATT CAG CAT GAA TCA GGA ATC 
GGG GAA GTT GGA GAC ACA CTG 
CAA GCA AGC AGA CCA TAT AAC 

30 ACT GAT GTC CGT CCT TTG TAT 
GGT GTA AAA CAT TTG AAG GAT 
GAA ATA TTC AAA TAT AAA TGG 
GGG CCA ACT AAA TCA GAT CCT 
TAC TCT AGT TTC GTT AAT ATG 

35 GGA CTC ATT GGC CCT CTC CTC 
GTA GAT CAA AGA GGA AAC CAG 
AAT GTC ATC CTG TTT TCT GTA 
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TGG TAC CTC ACA GAG AAT ATA CAA CGC TTT CTC CCC AAT 
CCA GCT GGA GTG CAG CTT GAG GAT CCA GAG TTC CAA GCC 
TCC AAC ATC ATG CAC AGC ATC AAT GGC TAT GTT TTT GAT 
AGT TTG CAG TTG TCA GTT TGT TTG CAT GAG GTG GCA TAC 
TGG TAC ATT CTA AGC ATT GGA GCA CAG ACT GAC TTC CTT 
TCT GTC TTC TTC TCT GGA TAT ACC TTC AAA CAC AAA ATG 
GTC TAT GAA GAC ACA CTC ACC CTA TTC CCA TTC TCA GGA 
GAA ACT GTC TTC ATG TCG ATG GAA AAC CCA GGT CTA TGG 
ATT CTG GGG TGC CAC AAC TCA GAC TTT CGG AAC AGA GGC 
ATG ACC GCC TTA CTG AAG GTT TCT AGT TGT GAC AAG AAC 
ACT GGT GAT TAT TAC GAG GAC AGT TAT GAA GAT ATT TCA 
GCA TAC TTG CTG AGT AAA AAC AAT GCC ATT GAA CCA AGA 
GAA ATA ACT CGT ACT ACT CTT CAG TCA GAT CAA GAG GAA 
ATT GAC TAT GAT GAT ACC ATA TCA GTT GAA ATG AAG AAG 
GAA GAT TTT GAC ATT TAT GAT GAG GAT GAA AAT CAG AGC 
CCC CGC AGC TTT CAA AAG AAA ACA CGA CAC TAT TTT ATT 
GCT GCA GTG GAG AGG CTC TGG GAT TAT GGG ATG AGT AGC 
TCC CCA CAT GTT CTA AGA AAC AGG GCT CAG AGT GGC AGT 
GTC CCT CAG TTC AAG AAA GTT GTT TTC CAG GAA TTT ACT 
GAT GGC TCC TTT ACT CAG CCC TTA TAC CGT GGA GAA CTA 
AAT GAA CAT TTG GGA CTC CTG GGG CCA TAT ATA AGA GCA 
GAA GTT GAA GAT AAT ATC ATG GTA ACT TTC AGA AAT CAG 
GCC TCT CGT CCC TAT TCC TTC TAT TCT AGC CTT ATT TCT 
TAT GAG GAA GAT CAG AGG CAA GGA GCA GAA CCT AGA AAA 
AAC TTT GTC AAG CCT AAT GAA ACC AAA ACT TAC TTT TGG 
AAA GTG CAA CAT CAT ATG GCA CCC ACT AAA GAT GAG TTT 
GAC TGC AAA GCC TGG GCT TAT TTC TCT GAT GTT GAC CTG 
GAA AAA GAT GTG CAC TCA GGC CTG ATT GGA CCC CTT CTG 
GTC TGC CAC ACT AAC ACA CTG AAC CCT GCT CAT GGG AGA 
CAA GTG ACA GTA CAG GAA TTT GCT CTG TTT TTC (CTC) ACC 
ATC TTT GAT GAG ACC AAA AGC TGG TAC TTC ACT GAA AAT 
ATG GAA AGA AAC TGC AGG GCT CCC TGC AAT ATC CAG ATG 
GAA GAT CCC ACT TTT AAA GAG AAT TAT CGC TTC CAT GCA 
ATC AAT GGC TAC ATA ATG GAT ACA . CTA CCT GGC TTA GTA 
ATG GCT CAG GAT CAA AGG ATT CGA TGG TAT CTG CTC AGC 
ATG GGC AGC AAT GAA AAC ATC CAT TCT ATT CAT TTC AGT 
GGA CAT GTG TTC ACT GTA CGA AAA AAA GAG GAG TAT AAA 
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4. The recombinant DNA molecule according 
30 to any one of claims 1-3, wherein the DNA sequence 

coding on expression for the modified factor VIII :C- 
like polypeptide is operatively linked to an expres- 
sion control sequence. 
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. 5, The recombinant DNA molecule according 
to claim 4, wherein the expression control sequence 
is selected from the group consisting of the lac 
system , the trp system , the tac system, the trc 
system, major operator and promoter regions of 
phage A, the control region of fd coat protein, the 
early and late promoters of SV40, promoters derived 
from polyoma, adenovirus and simian virus, the 
promoter for 3-phosphoglycerate kinase or other 
glycolytic enzymes, the promoters of yeast acid phos- 
phatase, the promoters of the yeast a -mating factors, 
and other sequences known to control the expression 
of genes of prokaryotic or eukaryotic cells and their 
viruses, or combinations thereof. 

6. A modified factor VIII:C-like polypep- 
tide having a formula selected from the group consist- 
ing of: 

met ala thr arg arg tyr tyr leu gly ala val glu leu 
ser trp asp tyr met gin ser asp leu gly glu leu pro 
val asp ala arg phe pro pro arg val pro lys ser phe 
pro phe asn thr ser val val tyr lys lys thr leu phe 
val glu phe thr asp his leu phe asn ile ala lys pro 
arg pro pro trp met gly leu leu gly pro thr ile gin 
ala glu val tyr asp thr val val ile thr leu lys asn 
met ala ser his pro val ser leu his ala val gly val 
ser tyr trp lys ala ser glu gly ala glu tyr asp asp 
gin thr ser gin arg glu lys glu asp asp lys val phe 
pro gly gly ser his thr tyr val trp gin val leu lys 
glu asn gly pro met ala ser asp pro leu cys leu thr 
tyr ser tyr leu ser his val asp leu val lys asp leu 
asn ser gly leu ile gly ala leu leu val cys arg glu 
gly ser leu ala lys glu lys thr gin thr leu his lys 
phe ile leu leu phe ala val phe asp glu gly lys ser 
trp his ser glu thr lys asn ser leu met gin asp arg 
asp ala ala ser ala arg ala trp pro lys met his thr 
val asn gly try val asn arg ser leu pro gly leu ile 
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gly cys his arg lys ser val 
met gly thr thr pro glu val 
gly his thr phe leu val arg 
leu glu ile ser pro ile thr 
leu leu met asp leu gly gin 
ile ser ser his gin his asp 
lys val asp ser cys pro glu 
lys asn asn glu glu ala glu 
thr asp ser glu met asp val 
asn ser pro ser phe ile gin 
lys his pro lys thr trp val 
glu glu asp trp asp tyr ala 
asp asp arg ser tyr lys ser 
pro gin arg ile gly arg lys 
met ala tyr thr asp glu thr 
ile gin his glu ser gly ile 
gly glu val gly asp thr leu 
gin ala ser arg pro tyr asn 
thr asp val arg pro leu tyr 
gly val lys his leu lys asp 
glu ile phe lys tyr lys trp 
gly pro thr lys ser asp pro 
tyr ser ser phe val asn met 
gly leu ile gly pro leu leu 
val asp gin arg gly asn gin 
asn val ile leu phe ser val 
trp tyr leu thr glu asn ile 
pro ala gly val gin leu glu 
ser asn ile met his ser ile 
ser leu gin leu ser val cys 
trp tyr ile leu ser ile gly 
ser val phe phe ser gly tyr 
val tyr glu asp thr leu thr 
glu thr val phe met ser met 
ile leu gly cys his asn ser 
met thr ala leu leu lys val 
thr gly asp tyr tyr glu asp 
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ala tyr leu leu ser lys asn asn ala ile glu pro arg 
ser phe ser gin asp pro leu ala trp asp asn his tyr 
gly thr gin ile pro lys glu glu trp lys ser gin glu 
lys ser pro glu lys thr ala phe lys lys lys asp thr 
ile leu ser leu asn ala cys glu ser asn his ala ile 
ala ala ile asn glu gly gin asn lys* pro glu ile glu 
val thr trp ala lys gin gly arg thr glu arg leu cys 
ser gin asn pro pro val leu lys arg his gin arg glu 
ile thr arg thr thr leu gin ser asp gin glu glu ile 
asp tyr asp asp thr ile ser val glu met lys lys glu 
asp phe asp ile tyr asp glu asp glu asn gin ser pro 
arg ser phe gin lys lys thr arg his tyr phe ile ala 
ala val glu arg leu trp asp tyr gly met ser ser ser 
pro his val leu arg asn arg ala gin ser gly ser val 
pro gin phe lys lys val val phe gin glu phe thr asp 
gly ser phe thr gin pro leu tyr arg gly glu leu asn 
glu his leu gly leu leu gly pro tyr ile arg ala glu 
val glu asp" asn ile met val thr phe arg asn gin ala 
ser arg pro tyr ser phe tyr ser ser leu ile ser tyr 
glu glu asp gin arg gin gly ala glu pro arg lys asn 
phe val lys pro asn glu thr lys thr tyr phe trp lys 
val gin his his met ala pro thr . lys asp glu phe asp 
cys lys ala trp ala tyr phe ser asp val asp leu glu 
lys asp val his ser gly leu ile gly pro leu leu val 
cys his thr asn thr leu asn pro ala his gly arg gin 
val thr val gin glu phe ala leu phe phe (leu) thr 
ile phe asp glu thr lys ser trp tyr phe thr glu asn 
met glu arg asn cys arg ala pro cys asn ile gin met 
glu asp pro thr phe lys glu asn tyr arg phe his ala 
ile asn gly tyr ile met asp thr leu pro gly leu val 
met ala gin asp gin arg ile arg trp tyr leu leu ser 
met gly ser asn glu asn ile" his ser ile his phe ser 
gly his val phe thr val arg lys lys glu glu tyr lys 
met ala leu tyr asn leu tyr pro gly val phe glu thr 
val glu met leu pro ser lys ala gly ile trp arg val 
glu cys leu ile gly glu his leu his ala gly met ser 
thr leu phe leu val tyr ser asn lys cys gin thr pro 
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leu gly met ala ser gly his ile arg asp phe gin ile 
thr ala ser gly gin tyr gly gin trp ala pro lys leu 
ala arg leu his tyr ser gly ser ile asn ala trp ser 
thr lys glu pro phe ser trp ile lys val asp leu leu 
5 ala pro met ile ile his gly ile lys thr gin gly ala 
arg gin lys phe ser ser leu tyr ile ser gin phe ile 
ile met tyr ser leu asp gly lys lys trp gin thr tyr 
arg gly asn ser thr gly thr leu met val phe phe gly 
asn val asp ser ser gly ile lys his asn ile phe asn 

10 pro pro ile ile ala arg tyr ile arg leu his pro thr 
his tyr ser ile arg ser thr. leu arg met glu leu met 
gly cys asp leu asn ser cys ser met pro leu gly met 
glu ser lys ala ile ser asp ala gin ile thr ala ser 
ser tyr phe thr asn met phe ala thr trp ser pro ser 

15 lys ala arg leu his leu gin gly arg ser asn ala trp 
arg pro gin val asn asn pro lys glu trp leu gin val 
asp phe gin lys thr met lys val thr gly val thr thr 
gin gly val lys ser leu leu thr ser met tyr val lys 
glu phe leu ile ser ser ser gin asp gly his gin trp 

20 thr leu phe phe gin asn gly lys val lys val phe gin 
gly asn gin asp ser phe thr pro val val asn ser leu 
asp pro pro leu leu thr arg tyr leu arg ile his pro 
gin ser trp val his gin ile ala leu arg met glu val 
leu gly cys glu ala gin asp leu tyr; 

25 ala thr arg arg tyr tyr leu gly ala val glu leu ser 
trp asp tyr met gin ser asp leu gly glu leu pro val 
asp ala arg phe pro pro arg val pro lys ser phe pro 
phe asn thr ser val val tyr lys lys thr leu phe val 
glu phe thr asp his leu phe asn ile ala lys pro arg 

30 pro pro trp met gly leu leu gly pro thr ile gin ala 
glu val tyr asp thr val val ile thr leu lys asn met 
ala ser his pro val ser leu his ala val gly val ser 
tyr trp lys ala ser glu gly ala glu tyr asp asp gin 
thr ser gin arg glu lys glu asp asp lys val phe pro. 

35 gly gly ser his thr tyr val trp gin val leu lys glu 
asn gly pro met ala ser asp pro leu cys leu thr tyr 
ser tyr leu ser his val asp leu val lys asp leu asn 
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ser gly leu ile gly ala leu leu val cys arg glu gly 
ser leu ala lys glu lys tfar gin thr leu his lys phe 
ile leu leu phe ala val phe asp glu gly lys ser trp 
his ser glu thr lys asn ser leu met. gin asp arg asp 
ala ala ser ala arg ala trp pro lys met his thr val 
asn gly try val asn arg ser leu pro gly leu ile gly 
cys his arg lys ser val tyr trp his val ile gly met 
gly thr thr pro glu val his ser ile phe leu glu gly 
his thr phe leu val arg asn his arg gin ala ser leu 
glu ile ser pro ile thr phe leu thr ala gin thr leu 
leu met asp leu gly gin phe leu leu phe cys his ile 
ser ser his gin his asp gly met glu ala tyr val lys 
val asp ser cys pro glu glu pro gin leu arg met lys 
asn asn glu glu ala glu asp tyr asp asp asp leu thr 
15 asp ser glu met asp val val arg phe asp asp asp asn 
ser pro ser phe ile gin ile arg ser val ala lys lys 
his pro lys thr trp val his tyr ile ala ala glu glu 
glu asp trp asp tyr ala pro leu val leu ala pro asp 
asp arg ser tyr lys ser gin tyr leu asn asn gly pro 
gin arg ile gly arg lys tyr lys lys val arg phe met 
ala tyr thr asp glu thr phe lys thr arg glu ala ile 
gin his glu ser gly ile leu gly pro leu leu tyr gly 
glu val gly asp thr leu leu ile ile phe lys asn gin 
ala ser arg pro tyr asn ile tyr pro his gly ile thr 
asp val arg pro leu tyr ser arg arg leu pro lys gly 
val lys his leu lys asp phe pro ile leu pro gly glu 
ile phe lys tyr lys trp thr val thr val glu asp gly 
pro thr lys ser asp pro arg cys leu thr arg tyr tyr 
ser ser phe val asn met glu arg asp leu ala ser gly 
leu ile gly pro leu leu ile cys tyr lys glu ser val 
asp gin arg gly asn gin ile met ser asp lys arg asn 
val ile leu phe ser val phe asp glu asn arg ser trp 
tyr leu thr glu asn ile gin arg phe leu pro asn pro 
ala gly val gin leu glu asp pro glu phe gin ala ser 
asn ile met his ser ile asn gly tyr val phe asp ser 
leu gin leu ser val cys leu his glu val ala tyr trp 
tyr ile leu ser ile gly ala gin thr asp phe leu ser 
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val phe phe ser gly tyr thr phe lys his lys met val 
tyr glu asp thr leu thr leu phe pro phe ser gly glu 
thr val phe met ser met glu asn pro gly leu trp ile 
leu- gly cys his asn ser asp phe arg asn arg gly met 
thr ala leu leu lys val ser ser cys asp lys asn thr 
gly asp tyr tyr glu asp ser tyr glu asp ile ser ala 
tyr leu leu ser lys asn asn ala ile glu pro arg ser 
phe ser gin asp pro leu ala trp asp asn his tyr gly 
thr gin ile pro lys glu glu tip lys ser gin glu lys 
ser pro glu lys thr ala phe lys lys lys asp thr ile 
leu ser leu asn ala cys glu ser asn his ala ile ala 
ala ile asn glu gly gin asn lys pro glu ile glu val 
thr trp ala lys gin gly arg thr glu arg leu cys ser 
gin asn pro pro val leu lys arg his gin arg glu ile 
15 thr arg thr thr leu gin ser asp gin glu glu ile asp 
tyr asp asp thr ile ser val glu met lys lys glu asp 
phe asp ile. tyr asp glu asp glu asn gin ser pro arg 
ser phe gin lys lys thr arg his tyr phe ile ala ala* 
val glu arg leu trp asp tyr gly met ser ser ser pro 
20 his val leu arg asn arg ala gin ser gly ser val pro 
gin phe lys lys val val phe gin glu phe thr asp gly 
ser phe thr gin pro leu tyr arg gly glu leu asn glu 
his leu gly leu leu gly pro tyr ile arg ala glu val 
glu asp asn ile met val thr phe arg asn gin ala ser 
25 arg pro tyr ser phe tyr ser ser leu ile ser tyr glu 
glu asp gin arg gin gly ala glu pro arg lys asn phe 
val lys pro asn glu thr lys thr tyr phe trp lys val 
gin his his met ala pro thr lys asp glu phe asp cys 
lys ala trp ala tyr phe ser asp val asp leu glu lys 
30 asp val his ser gly leu ile gly pro leu leu val cys 
his thr asn thr leu asn pro ala his gly arg gin val 
thr val gin glu phe ala leu phe phe (leu) thr ile 
phe asp glu thr lys ser trp tyr phe thr glu asn met 
glu arg asn cys arg ala pro cys asn ile gin met glu 
35 asp pro thr phe lys glu asn tyr arg phe his ala ile 
asn gly tyr ile met asp thr leu pro gly leu val met 
ala gin asp gin arg ile arg trp tyr leu leu ser met 



WO 8S/00831 



PCT/US87/01814 



-57- 

gly ser asn glu asn ile his ser ile his phe ser gly 
his val phe thr val arg lys lys glu glu tyr lys met 
ala leu tyr asn leu tyr pro gly val phe glu thr val 
glu met leu pro ser lys ala gly ile trp arg val glu 
cys leu ile gly glu his leu his ala gly met ser thr 
leu phe leu val tyr ser asn lys cys gin thr pro leu 
gly met ala ser gly his ile arg asp phe gin ile thr 
ala ser gly gin tyr gly gin trp ala pro lys leu ala 
arg leu his tyr ser gly ser ile asn ala trp ser thr 
lys glu pro phe ser trp ile lys val asp leu leu ala 
pro met ile ile his gly ile lys thr gin gly ala arg 
gin lys phe ser ser leu tyr ile ser gin phe ile ile 
met tyr ser leu asp gly lys lys trp gin thr tyr arg 
gly asn ser thr gly thr leu met val phe phe gly asn 
val asp ser ser gly ile lys his asn ile phe asn pro 
pro ile ile ala arg tyr ile arg leu his pro thr his 
tyr ser ile arg ser thr leu arg met glu leu met gly 
cys asp leu asn ser cys ser met pro leu gly met glu 
ser lys ala ile ser asp ala gin ile thr ala ser ser 
tyr phe thr asn met phe ala thr trp ser pro ser lys 
ala arg leu his leu gin gly arg ser asn ala trp arg 
pro gin val asn asn pro lys glu trp leu gin val asp 
phe gin lys thr met lys val thr gly val thr thr gin 
gly val lys ser leu leu thr ser met tyr val lys glu 
phe leu ile ser ser ser gin asp gly his gin trp thr 
leu phe phe gin asn gly lys val lys val phe gin gly 
asn gin asp ser phe thr pro val val asn ser leu asp 
pro pro leu leu thr arg tyr leu arg ile his pro gin 
ser trp val his gin ile ala leu arg met glu val leu 
gly cys glu ala gin asp leu tyr; 

met ala thr arg arg tyr tyr leu gly ala val glu leu 
ser trp asp tyr met gin ser asp leu gly glu leu pro 
val asp ala arg phe pro pro arg val pro lys ser phe 
pro phe asn thr ser val val tyr lys lys thr leu phe 
val glu phe thr asp his leu phe asn ile ala lys pro 
arg pro pro trp met gly leu leu gly pro thr ile gin 
ala glu val tyr asp thr val val ile thr leu lys asn 
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met ala ser his pro val ser leu his ala val gly val 
ser tyr trp lys ala ser glu gly ala glu tyr asp asp 
gin thr ser gin arg glu lys glu asp asp lys val phe 
pro gly gly ser his thr tyr val trp gin val leu lys 
5 v glu asn gly pro met ala ser asp pro leu cys leu thr 
tyr ser tyr leu ser his val asp leu val lys asp leu 
asn ser gly leu ile gly ala leu leu val cys arg glu 
gly ser leu ala lys glu lys thr gin thr leu his lys 
phe ile leu leu phe ala val phe asp glu gly lys ser 

10 trp his ser glu thr lys asn ser leu met gin asp arg 
asp ala ala ser ala arg ala trp pro lys met his thr 
val asn' gly tyr val asn arg ser leu pro gly leu ile 
gly cys his arg lys ser val tyr trp his val ile gly 
met gly thr thr pro glu val his ser ile phe leu glu 

15 gly his thr phe leu val arg asn his arg gin ala ser 
leu glu ile ser pro ile thr phe leu thr ala gin thr 
leu leu met asp leu gly gin phe leu leu phe cys his 
ile ser ser his gin his asp gly met glu ala tyr val 
lys val asp ser cys pro glu glu pro gin leu arg met 

20 lys asn asn glu glu ala glu asp tyr asp asp asp leu 
thr asp ser glu met asp val val arg phe asp asp asp 
asn ser pro ser phe ile gin ile arg ser val ala lys 
lys his pro lys thr trp val his tyr ile ala ala glu 
glu glu asp trp asp tyr ala pro leu val leu ala pro 

25 asp asp arg ser tyr lys ser gin tyr leu asn asn gly 
pro gin arg ile gly arg lys try lys lys val arg phe 
met ala tyr thr asp glu thr phe lys thr arg glu ala 
ile gin his glu ser gly ile leu gly pro leu leu tyr 
gly glu val gly asp thr leu leu ile ile phe lys asn 

30 gin ala ser arg pro tyr asn ile tyr pro his gly ile 
thr asp val arg pro leu tyr ser arg arg leu pro lys 
gly val lys his leu lys asp phe pro ile leu pro gly 
glu ile phe lys tyr lys trp thr val thr val glu asp 
gly pro thr lys ser asp pro arg cys leu thr arg tyr 

35 tyr ser ser phe val asn met glu arg asp leu ala ser 
gly leu ile gly pro leu leu ile cys tyr lys glu ser 
val asp gin arg gly 'asn gin ile met ser asp lys arg 
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ser gly his val phe thr val arg lys lys glu glu tyr 
lys met ala leu tyr asn leu tyr pro gly val phe glu 
thr val glu met leu pro ser lys ala gly ile trp arg 
val glu cys leu ile gly glu his leu his ala gly met 
5 ser thr leu phe leu val tyr ser asn lys cys gin thr 
pro leu gly met ala ser gly his ile arg asp phe gin 
ile thr ala ser gly gin tyr gly gin trp ala pro lys 
leu ala arg leu his tyr ser gly ser ile asn ala trp 
ser thr lys glu pro phe ser trp ile lys val asp leu 

10 leu ala pro met ile ile his gly ile lys thr gin gly 
ala arg gin lys phe ser ser leu tyr ile ser gin phe 
ile ile met tyr ser leu asp gly lys lys trp gin thr 
tyr arg gly asn ser thr gly thr leu met val phe phe 
gly asn val asp ser ser gly ile lys his asn ile phe 

15 asn pro pro ile ile ala arg tyr ile arg leu his pro 
thr his tyr ser ile arg ser thr leu arg met glu leu 
met gly cys asp leu asn ser cys ser met pro leu gly 
met glu ser lys ala ile ser asp ala gin ile thr ala 
ser ser tyr phe thr asn met phe ala thr trp ser pro 

20 ser lys ala arg leu his leu gin gly arg ser asn ala 
trp arg pro gin val asn asn pro lys glu trp leu gin 
val asp phe gin lys thr met lys val thr gly val thr 
thr gin gly val lys ser leu leu thr ser met tyr val 
lys glu phe leu ile ser ser ser gin asp gly his gin 

25 trp thr leu phe phe gin asn gly lys val lys val phe 
gin gly asn gin asp ser phe thr pro val val asn ser 
leu asp pro pro leu leu thr arg tyr leu arg ile his 
pro gin ser trp val his gin ile ala leu arg met glu 
val leu gly cys glu ala gin asp leu tyr; and 

30 ala thr arg arg tyr tyr leu gly ala val glu leu ser 
trp asp tyr met gin ser asp leu gly glu leu pro val 
asp ala arg phe pro pro arg val pro lys ser phe pro 
phe asn thr ser val val tyr lys lys thr leu phe val 
'glu phe thr asp his leu phe asn ile ala lys pro arg 

35" pro pro trp met gly leu leu gly pro thr ile gin ala 
glu val tyr asp thr val val ile thr leu lys asn met 
ala ser his pro val ser leu his ala val gly val ser 
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sex glu gly ala glu tyx 
glu lys glu asp asp lys 
thx tyx val trp gin val 
ala sex asp pxo leu cys 
his val asp leu val lys 
gly ala leu leu val cys 
glu lys thx gin thx leu 
ala val phe asp glu gly 
lys asn sex leu met gin 
axg ala txp pxo lys met 
asn axg sex leu pxo gly 
sex val tyx txp his val 
glu val his sex ile phe 
val axg asn his axg gin 
ile thx phe leu thx ala 
gly gin phe leu leu phe 
his asp gly met glu ala 
pro glu glu pxo gin leu 
ala glu asp tyx asp asp 
asp val val axg phe asp 
ile gin ile axg sex val 
txp val his tyx ile ala 
tyx ala pro leu val leu 
lys ser gin tyx leu asn 
axg lys try lys lys val 
glu thx phe lys thx axg 
gly ile leu gly pxo leu 
thx leu leu ile ile phe 
tyr asn ile tyr pro his 
leu tyr ser arg axg leu 
lys asp phe pxo ile leu 
lys txp thx val thx val 
asp pxo axg cys leu thx 
asn met glu arg asp leu 
leu leu ile cys tyx lys 
asn gin ile met sex asp 
sex val phe asp glu asn 



PCT/US87/01814 



asp 


asp 


gin 


val phe 


pxo 


leu lys 


glu 


leu 


thr 


tyx 


asp 


leu 


asn 


axg glu 


gly 


his 


lys 


phe 


lys 


ser 


txp 

XT 


asp 


axg 


asp 


his 


thx 


val 


leu 


ile 


gly 


ile 


gly 


met 


leu glu 


gly 


ala 


sex 


leu 


gin thx 


leu 


cys 


his 


ile 


tyx. val 


lys 


axg 


met 


lys 


asp 


leu 


thx 


asp 


asp 


asn 


ala 


lys 


lys 


ala 


glu 


glu 


ala 


pxo 


asp 


asn 


gly 


pxo 


axg 


phe 


met 


glu 


ala 


ile 


leu 


tyx 


gly 


lys 


asn 


gin 


gly 


ile 


thx 


pxo 


lys 


gly 


pxo 


gly 


glu 


glu 


asp 


gly 


axg 


tyx 


tyr 


ala 


sex 


gly 


glu 


sex 


val 


lys 


axg 


asn 


axg 


sex 


trp 



WO 88/00831 PCT/US87/018W 



-62- 

tyr leu thr glu asn ile gin arg phe leu pro asn pro 
ala gly val gin leu glu asp pro glu phe gin ala ser 
asn ile met his ser ile asn gly tyr val phe asp ser 
leu gin leu ser val cys leu his glu val ala tyr trp 
5 tyr ile leu ser ile gly ala gin thr asp phe leu ser 
val phe phe ser gly tyr thr phe lys his lys met val 
tyr glu asp thr leu thr leu phe pro phe ser gly glu 
thr val phe met ser met glu asn pro gly leu trp ile 
leu gly cys his asn ser asp phe arg asn arg gly met 

10 thr ala leu leu lys val ser ser cys asp lys asn thr 
gly asp tyr tyr glu asp ser tyr glu asp ile ser ala 
tyr leu leu ser lys asn asn ala ile glu pro arg glu 
ile thr arg thr thr leu gin ser asp gin glu glu ile 
asp tyr asp aspi thr ile ser val glu met lys lys glu 

15 asp phe asp ile tyr asp glu asp glu asn gin ser pro 
arg ser phe gin lys lys thr arg his tyr phe ile ala 
ala val glu arg .leu trp asp tyr gly met ser ser ser 
pro his val leu arg asn arg ala gin ser gly ser val 
pro gin phe lys lys val val phe gin glu phe thr asp 

20 gly ser phe thr gin pro leu tyr arg gly glu leu asn 
glu his leu gly leu leu gly pro tyr ile arg ala glu 
val glu asp asn ile met val thr phe arg asn gin ala 
ser arg pro tyr ser phe tyr ser ser leu ile ser tyr 
glu glu asp gin arg gin gly ala glu pro arg lys asn 

25 phe val lys pro asn glu thr lys thr tyr phe trp lys 
val gin his his met ala pro thr lys asp glu phe asp 
cys lys ala trp ala tyr phe ser asp val asp leu glu 
lys asp val his ser gly leu ile gly pro leu leu val 
cys his thr asn thr leu asn pro ala his gly arg gin 

30 val thr val gin glu phe ala leu phe phe(leu) thr 

ile phe asp glu thr lys ser trp tyr phe thr glu asn 
met glu arg asn cys arg ala pro cys asn ile gin met 
glu asp pro thr phe lys glu asn tyr arg phe his ala 
ile asn gly tyr ile met asp thr leu pro gly leu val 

35 met ala gin asp gin arg ile arg trp tyr leu leu ser 
met gly ser asn glu asn ile his ser ile his phe ser 
gly his val phe thr val arg lys lys glu glu tyr lys 
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7. A modified factor VIII:C-like polypep- 
30 tide, comprising the N-terminal heavy chain of mature 

factor VIII :C linked directly to the C- terminal light 
chain of mature factor VIII :C, said polypeptide being 
essentially free of other serum proteins. 

8. A process for producing a polypeptide, 
35 comprising the step of proteolytically cleaving the 
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modified factor VIII:C-like polypeptide of claim 7 
into the N-terminal heavy chain of mature factor 
VIII :C and the C- terminal light chain of mature 
factor VIII :C. 

5 9* The process according to claim 8,. 

further comprising the step of linking together by 
an alkaline metal bridge , the N- terminal heavy chain 
of mature factor VIII :C and the C-terminal heavy 
chain of mature factor VIII rC. 

10 10. A process for producing a modified 

factor VIII:C-like polypeptide comprising the step 
of culturing a host transformed with a recombinant 
DNA molecule as defined in claims 1 through 5. 

11. The process according to any of 
15 claims 8, 9 or 10, wherein the host is selected 

from BMT10, BSC1, BSC40 , COS1, COS7, CHO cells and 
other animal and human cells in culture, 

12. A pharmaceutical composition comprising 
a polypeptide, produced according to the process of 

20 any of claims 8, 9 or 10, in an amount effective as 

a coagulant and a pharmaceutically acceptable carrier. 

13. A pharmaceutical composition comprising 
a modified factor VIII:C-like polypeptide as defined 
in any one of claims 6-9 in an amount effective as a 

25 coagulant and a pharmaceutically acceptable carrier. 

14. A method for treating haemophilia 
comprising the step of treating a human with the 
pharmaceutical composition as defined in claims 12 
and 13. 
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15. A recombinant DNA molecule according 
to claim 4, selected from the group consisting of 

the recombinant DNA molecules contained in transformed 
host E.coli HBIOI(RE), E.coli (HBIOI(RD) or E.coli 
EBIOI(RSD). 

16. A modified factor VIII:C-like polypep- 
tide produced by a host transformed with a recombinant 
DNA molecule selected from a group consisting of 
recombinant DNA molecules contained in transformed 
host E.coli HBIOI(RE), E.coli HBIOI(RD) and E.coli 
HBIOI(RSD). 

17. A process for producing a modified 
factor VIII;C-like polypeptide comprising the step 
of culturing a host transformed with a recombinant 
DNA molecule selected from a group consisting of 
recombinant DNA molecules contained in transformed 
host E.coli HBIOI(RE), E.coli EBIOI(RD) and E.coli 
HBIOI(RSD). 

18. A pharmaceutical composition comprising 
a polypeptide produced according to the process of 
claim 17 in an amount effective as a coagulant and a 
pharmaceutically acceptable carrier. 

19. A method for treating haemophilia 
comprising the step of treating a human with a phar- 
maceutical composition according to claim 18. 
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1 10 
ala thr arg arg tyr tyr leu gly ala val glu leu ser 
GCC ACC AGA AGA TAC TAG CTG GGT. GCA GTG GAA CTG TCA 



20 

trp asp tyr met gin ser asp leu gly glu leu pro val 
TGG GAC TAT ATG CAA AGT GAT CTC GGT GAG CTG CCT GTG 



30 

asp ala arg phe pro pro arg val pro lys ser phe pro 
GAC GCA AGA TTT CCT CCT AGA GTG CCA AAA TCT TTT CCA 

40 50 
phe asn thr ser val val tyr lys lys thr leu phe val 
TTC AAC ACC TCA GTC GTG TAC AAA AAG ACT CTG TTT GTA 



ecoRI 

glu phe thr asp his leu 
GAA TTC ACG GAT CAC CTT 

70 

pro pro trp met gly leu 
CCA CCC TGG ATG GGT CTG 



60 

phe asn ile ala lys pro arg 
TTC AAC ATC GCT AAG CCA AGG 



leu gly pro thr ile gin ala 
CTA GGT CCT ACC ATC CAG GCT 



80 90 
glu val tyr asp thr val val ile thr leu lys asn met 
GAG GTT TAT GAT ACA GTG GTC ATT ACA CTT AAG AAC ATG 



100 

ala ser his pro val ser leu his ala val gly val ser 
GCT TCC CAT CCT GTC AGT CTT CAT GCT GTT GGT GTA TCC 



FIG. 7 

SUBSTITUTE SHEET 
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hindlll HO 
tyr trp lys ala ser glu gly 

TAC TGG AAA -GCT TCT GAG GGA 
120 

thr ser gin arg glu lys glu 
ACC AGT CAA AGG GAG AAA GAA 



giy giy ser his thr tyr val 

GGT GGA AGC CAT ACA TAT GTC 

150 

asn gly pro met ala ser asp 
AAT GGT CCA ATG GCC TCT GAC 

160 

ser tyr leu ser his val asp 
TCA TAT CTT TCT CAT GTG GAC 

170 

ser gly leu ile gly ala leu 
TCA GGC CTC ATT GGA GCC CTA 



ser leu ala lys glu lys thr 
AGT CTG GCC AAG GAA AAG ACA 

200 

ile leu leu phe ala val phe 
ATA CTA CTT TTT GCT GTA TTT 

210 

his ser glu thr lys asn ser 
CAC TCA GAA ACA AAG AAC TCC 



ala glu tyr asp asp gin 
GCT GAA TAT GAT GAT CAG 

130 

asp asp. lys^val.piievpro 
GAT GAT." 'AAA . GTC" TEC CCT 

•*■ 

trp gin val/ leu.. lys; : glu 
TGG CAC GTC CTG, AAArVGAG 



pro leu cys leu thr tyr 
CCA CTG TGC CTT ACC TAC 

ecoRI 

leu val lys asp leu asn 
CTG GTA AAA GAC TTG AAT 

180 

leu val cys arg glu gly 
CTA GTA TGT AGA GAA GGG 

190 

gin thr leu his lys phe 
CAC ACC TTG CAC AAA TTT 



asp glu gly lys ser trp 

GAT GAA GGG AAA AGT TGG 
220 

leu met gin asp arg asp 
TTG ATG CAG GAT AGG GAT 
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ala ala ser ala arg ala 
GCT CCA TCT GCT CGG GCC 

240 

asn gly try val asn arg 
AAT GGT TAT GTA AAC AGG 

250 

cys his arg lys ser val 
TGC CAC AGG AAA TCA GTC 



gly thr thr pro glu val 
GGC ACC ACT CCT GAA GTG 



his thr phe leu val arg 
CAC ACA TTT CTT GTG AGG 

290 

glu ile ser pro ile thr 
GAA ATC TCG' CCA ATA ACT 

300 

leu met asp leu gly gin 
TTG ATG GAC CTT GGA CAG 

ser ser his gin his asp 
TCT TCC CAC CAA CAT GAT 

330 

val asp ser cys pro glu 

GTA GAC AGC TGT CCA GAG 



(t/ZS 

230 

trp pro lys met his thr val 
TGG CCT A£A ATG CAC ACA GTC 

ser leu pro gly leu ile gly 
TCT CTG CCA GGT CTG ATT GGA 

260 

tyr trp his val ile gly met 
TAT TGG CAT GTG ATT GGA ATG 

270 

his ser ile phe leu glu gly 
CAC TCA ATA TTC CTC GAA GGT 

280 

asn his arg gin ala ser leu 
AAC CAT CGC CAG GCG TCC TTG 

phe leu thr ala gin thr leu 
TTC CTT ACT GCT CAA ACA CTC 

310 

phe leu leu phe cys his ile 
TTT CTA CTG TTT TGT CAT ATC 

320 hindlll 
gly met glu ala tyr val lys 
GGC ATG GAA GCT TAT GTC AAA 



glu pro gin leu arg met lys 
GAA CCC CAA CTA CGA ATG AAA 



F/G. 7(cont'd) 
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340 350 
asn asn glu glu ala glu asp tyr asp asp asp leu thr 
AAT AAT GAA GAA GCG GAA GAC TAT CAT GAT GAT CTT ACT 

360 

asp ser glu met asp val val arg phe asp asp asp asn 
GAT TCT GAA ATG GAT GTG GTC AGG TTT GAT GAT GAC AAC 

370 

ser pro ser phe ile gin ile arg ser val ala lys lys 
TCT CCT TCC TTT ATC CAA ATT CGC TCA GTT GCC AAG AAG 

380 390 
his pro lys thr trp val his tyr ile ala ala glu glu 
CAT CCT AAA ACT TGG GTA CAT TAC ATT GCT GCT GAA GAG 

400 

glu asp trp asp tyr ala pro leu val leu ala pro asp 
GAG GAC TGG GAC TAT GCT CCC TTA GTC CTC GCC CCC GAT 

410 

asp arg ser tyr lys ser gin tyr leu asn asn gly pro 
GAC AGA AGT TAT AAA AGT CAA TAT TTG AAC AAT GGC CCT 

420 

gin arg ile gly arg lys tyr lys lys val arg phe met 
CAG CGG ATT GGT AGG AAG TAC AAA AAA GTC CGA TTT ATG 

430 440 

ala tyr thr asp glu thr phe lys thr arg glu ala ile 

GCA TAC ACA GAT GAA ACC TTT AAG ACT CGT GAA GCT ATT 

450 

gin his glu ser gly ile leu gly pro leu leu tyr gly 
CAG CAT GAA TCA GGA ATC TTG GGA CCT TTA CTT TAT GGG 



FIG. 7(cont'd) 



WO 88/00831 



PCT/US87/01814 



'3/28 



460 

glu val gly asp thr leu leu ile ile phe lys asn gin 
GAA GTT GGA GAC ACA CTG TTG ATT ATA TTT AAG AAT CAA 



ala ser arg pro tyr asn ile tyr pro his gly ile thr 
GCA AGC AGA CCA TAT AAC ATC TAC CCT CAC GGA ATC ACT 

490 

asp val arg pro leu tyr ser arg arg leu pro lys gly 
GAT GTC CGT CCT TTG TAT TCA AGG AGA TTA CCA AAA GGT 

500 

val lys his leu lys asp phe pro ile leu pro gly glu 
GTA AAA CAT TTG AAG GAT TTT CCA ATT CTG CCA GGA GAA 



ile phe lys tyr lys trp thr val thr val glu asp gly 
ATA TTC AAA TAT AAA TGG ACA GTG ACT GTA GAA GAT GGG 

530 

pro thr lys ser asp pro arg cys leu thr arg tyr tyr 
CCA ACT AAA TCA GAT CCT CGG TGC CTG ACC CGC TAT TAC 



ser ser phe val asn met glu arg asp leu ala ser gly 
TCT AGT TTC GTT AAT ATG GAG AGA GAT CTA GCT TCA GGA 

550 

leu ile gly pro leu leu ile cys tyr lys glu ser val 
CTC ATT GGC CCT CTC CTC ATC TGC TAC AAA GAA TCT GTA 

560 570 

asp gin arg gly asn gin ile met ser asp lys arg asn 

GAT CAA AGA GGA AAC CAG ATA ATG TCA GAC AAG AGG AAT 



470 



480 



510 



520 



540 
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580 k 
val ile leu phe ser val phe asp glu asn arg ser trp 
GTC ATC CTG TTT TCT GTA TTT GAT GAG AAC CGA AGC TGG 

pnl 590 

tyr leu thr glu asn ile gin arg phe leu pro asn pro 
TAG CTC ACA GAG AAT ATA CAA CGC TTT CTC CCC AAT CCA 



600 bamHI 610 

ala gly val gin leu glu asp pro glu phe gin ala ser 
GCT GGA GTG CAG CTT GAG GAT CCA GAG TTC CAA GCC TCC 



asn ile met his ser ile asn 
AAC ATC ATG CAC AGC ATC AAT 

630 

leu gin leu ser val cys leu 
TTG CAG TTG TCA GTT TGT TTG 



620 

gly tyr val phe asp ser 
GGC TAT GTT TTT GAT AGT 



his glu val ala tyr trp 
CAT GAG GTG GCA TAC TGG 



640 650 
tyr ile leu ser ile gly ala gin thr asp phe leu ser 
TAC ATT CTA AGC ATT GGA GCA CAG ACT GAC TTC CTT TCT 



660 

val phe phe ser gly tyr thr phe lys his lys met val 
GTC TTC TTC TCT GGA TAT ACC TTC AAA CAC AAA ATG GTC 



670 

tyr glu asp thr leu thr leu phe pro phe ser gly glu 
TAT GAA GAC ACA CTC ACC CTA TTC CCA TTC TCA GGA GAA 



680 

thr val phe met ser met glu asn pro gly leu trp ile 
ACT GTC TTC ATG TCG ATG GAA AAC CCA GGT CTA TGG ATT 
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690 700 

leu gly cys his asn ser asp phe arg asn arg gly met 

CTG GGG TGC CAC AAC TCA GAC TTT *CGG AAC AGA GGC ATG 



thr ala leu leu lys val 
ACC GCC TTA CTG AAG GTT 

720 

gly asp tyr tyr glu asp 
GGT GAT TAT TAC GAG GAC 

730 

tyr leu leu ser lys asn 
TAC TTG CTG AGT AAA AAC 

II ecoRI 

phe ser gin asn ser arg 

TTC TCC CAG AAT TCA AGA 

760 

gin phe asn ala thr thr 
CAA TTT AAT GCC ACC ACA 



710 

ser ser cys asp lys asn thr 
TCT AGT TGT GAC AAG AAC ACT 



ser tyr glu asp ile ser ala 
AGT TAT GAA GAT ATT TCA GCA 

hindl 

asn ala ile glu pro arg ser 
AAT GCC ATT GAA CCA AGA AGC 

750 

his pro ser thr arg gin lys 
CAC CCT AGC ACT AGG CAA AAG 

ile pro glu asn asp ile glu 
ATT CCA GAA AAT GAC ATA GAG 



770 780 
lys thr asp pro trp phe ala his arg thr pro met pro 
AAG ACT GAC CCT TGG TTT GCA CAC AGA ACA CCT ATG CCT 



790 

lys ile gin asn val ser ser ser asp leu leu met leu 
AAA ATA CAA AAT GTC TCC TCT AGT GAT TTG TTG ATG CTC 



800 

leu arg gin ser pro thr pro his gly leu ser leu ser 
TTG CGA CAG AGT CCT ACT CCA CAT GGG CTA TCC TTA TCT 
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810 

asp leu gin glu ala lys tyr glu thr phe ser asp asp 
GAT CTC CAA GAA GCC AAA TAT GAG ACT TTT TCT GAT GAT 



820 830 

pro ser pro gly ala ile asp ser asn asn ser leu ser 

CCA TCA CCT GGA GCA ATA GAC AGT AAT AAC AGC CTG TCT 



840 

glu met thr his phe arg pro gin leu his his ser gly 
GAA ATG ACA CAC TTC AGG CCA CAG CTC CAT CAC AGT GGG 



850 

asp met val phe thr pro glu ser gly leu gin leu arg 
GAC ATG GTA TTT ACC CCT GAG TCA GGC CTC CAA TTA AGA 



860 870 
leu asn glu lys leu gly thr thr ala ala thr glu leu 
TTA AAT GAG AAA CTG GGG ACA ACT GCA GCA ACA GAG TTG 



lys lys leu asp phe lys val 
AAG AAA CTT GAT TTC AAA GTT 

890 

leu ile ser thr ile pro ser 
CTG ATT TCA ACA ATT CCA TCA 



880 

ser ser thr ser asn asn 
TCT AGT ACA TCA AAT AAT 



asp asn leu ala ala gly 
GAC AAT TTG GCA GCA GGT 



900 910 
thr asp asn thr ser ser leu gly pro pro ser met pro 
ACT GAT AAT ACA AGT TCC TTA GGA CCC CCA AGT ATG CCA 

920 

val his tyr asp ser gin leu asp thr thr leu phe gly 
GTT CAT TAT GAT AGT CAA TTA GAT ACC ACT CTA TTT GGC 
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930 

lys lys ser ser pro leu thr glu ser gly gly pro leu 
AAA AAG TCA TCT CCC CTT ACT GAG TCT GGT GGA CCT CTG 



940 

ser leu ser glu glu asn asn 
AGC TTG AGT GAA GAA AAT AAT 

950 

ser gly leu met asn ser gin 
TCA GGT TTA ATG AAT AGC CAA 

asn val ser ser thr glu ser 
AAT GTA TCG TCA ACA GAG AGT 

sad 980 
lys arg ala his gly pro ala 
AAA AGA GCT CAT GGA CCT GCT 



asp ser lys lea, leu glu 
GAT TCA AACk TTC/.TTA GAA . 

960. 

glu se^ *s-eir,:tcp: v giL7 Lys:. . f - 
GAA AGTT TCA'iTGGX'GGA AAA • . 

970 

gly arg leu phe lys gly 
GGT AGG TTA TTT AAA GGG 

leu leu thr lys asp asn 
TTG TTG ACT AAA GAT AAT 



990 1000 
ala leu phe lys val ser ile ser leu leu lys thr asn 
GCC TTA TTC AAA GTT AGC ATC TCT TTG TTA AAG ACA AAC 



1010 

lys thr ser asn asn ser ala thr asn arg lys thr his 
AAA ACT TCC AAT AAT TCA GCA ACT AAT AGA AAG ACT CAC 



1020 

ile asp gly pro ser leu leu ile glu asn ser pro ser 
ATT GAT GGC CCA TCA TTA TTA ATT GAG AAT AGT CCA TCA 

1030 1040 
val trp gin asn ile leu glu ser asp thr glu phe lys 
GTC TGG CAA AAT ATA TTA GAA AGT GAC ACT GAG TTT AAA 
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1050 

lys val thr pro leu ile his asp arg met leu met asp 
AAA GTG ACA CCT TTG ATT CAT GAC AGA ATG CTT ATG GAC 

1060 

lys asn ala thr ala leu arg leu asn his met ser asn 
AAA AAT GCT ACA GCT TTG AGG CTA AAT CAT ATG TCA AAT 

1070 

lys thr thr ser ser lys asn met glu met val gin gin 
AAA ACT ACT TCA TCA AAA AAC ATG GAA ATG GTC CAA CAG 

1080 1090 

lys lys glu gly pro ile pro pro asp ala gin asn pro 

AAA AAA GAG GGC CCC ATT CCA CCA GAT GCA CAA AAT CCA 

1100 

asp met ser phe phe lys met leu phe leu pro glu ser 
GAT ATG TCG TTC TTT AAG ATG CTA TTC TTG CCA GAA TCA 

1110 

ala arg trp ile gin arg thr his gly lys asn ser leu 
GCA AGG TGG ATA CAA AGG ACT CAT GGA AAG AAC TCT CTG 

1120 1130 
asn ser gly gin gly pro ser pro lys gin leu val ser 
AAC TCT GGG CAA GGC CCC AGT CCA AAG CAA TTA GTA TCC 

1140 

leu gly pro glu lys ser val glu gly gin asn phe leu 
TTA GGA CCA GAA AAA TCT GTG GAA GGT CAG AAT TTC TTG 

1150 

ser glu lys asn lys val val val gly lys gly glu phe 
TCT GAG AAA AAC AAA GTG GTA GTA GGA AAG GGT GAA TTT 
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1160 1170 

thr lys asp val gly leu lys glu met val phe pro ser 

ACA AAG GAC GTA GGA CTC AAA GAG ArTG GTT TTT CCA AGC 

1180 

ser arg asn leu phe leu thr asn leu asp asn leu his 
AGC AGA AAC CTA TTT CTT ACT AAC TTG GAT AAT TTA CAT 

1190 

glu asn asn thr his asn gin glu lys lys ile gin glu 
GAA AAT AAT ACA CAC AAT CAA GAA AAA AAA ATT CAG GAA 

1200 

glu ile glu lys lys glu thr leu ile gin glu asn val 
GAA ATA GAA AAG AAG GAA ACA TTA ATC CAA GAG AAT GTA 

1210 1220 

val leu pro gin ile his thr val thr gly thr lys asn 

GTT TTG CCT CAG ATA CAT ACA GTG ACT GGC ACT AAG AAT 

1230 

phe met lys asn leu phe leu leu ser thr arg gin asn 
TTC ATG AAG AAC CTT TTC TTA CTG AGC ACT AGG CAA AAT 

1240 seal 
val glu gly ser tyr asp gly ala tyr ala pro val leu 
GTA GAA GGT TCA TAT GAC GGC GCA TAT GCT CCA GTA CTT 

1250 1260 
gin asp phe arg ser leu asn asp ser thr asn arg thr 
CAA GAT TTT AGG TCA TTA AAT GAT TCA ACA AAT AGA ACA 

1270 

lys lys his thr ala his phe ser lys lys gly glu glu 

AAG AAA CAC ACA GCT CAT TTC TCA AAA AAA GGG GAG GAA 
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1280 

glu asn leu glu gly leu gly asn gin thr lys gin ile 
GAA AAC TTG GAA GGC TTG GGA AAT €AA ACC AAG CAA ATT 

1290 sphl 1300 
val glu lys tyr ala cys thr thr arg ile ser pro asn 
GTA GAG AAA TAT GCA TGC ACC ACA AGG ATA TCT CCT AAT 

1310 

thr ser gin gin asn phe val thr gin arg ser lys arg 
ACA AGC CAG CAG AAT TTT GTC ACG CAA CGT AGT AAG AGA 

1320 

ala leu lys gin phe arg leu pro leu glu glu thr glu 
GCT TTG AAA CAA TTC AGA CTC CCA CTA GAA GAA ACA GAA 

1330 

leu glu lys arg ile ile val asp asp thr ser thr gin 
CTT GAA AAA AGG ATA ATT GTG GAT GAC ACC TCA ACC CAG 

1340 1350 

trp ser lys asn met lys his leu thr pro ser thr leu 

TGG TCC AAA AAC ATG AAA CAT TTG ACC CCG AGC ACC CTC 

1360 

thr gin ile asp tyr asn glu lys glu lys gly ala ile 
ACA CAG ATA GAC TAC AAT GAG AAG GAG AAA GGG GCC ATT 

1370 

thr gin ser pro leu ser asp cys deu thr arg ser his 
ACT CAG TCT CCC TTA TCA GAT TCG CTT ACG AGG AGT CAT 

1380 1390 
ser ile pro gin ala asn arg ser pro leu pro ile ala 
AGC ATC CCT CAA GCA AAT AGA TCT CCA TTA CCC ATT GCA 
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1400 

lys val ser ser phe pro ser ile arg pro ile tyr leu 
AAG GTA TCA TCA TTT CCA TCT ATT AGA CCT ATA TAT CTG 



thr arg val leu phe gin 
ACC AGG GTC CTA TTC CAA 

1420 

ala ala ser tyr arg lys 
GCA GCA TCT TAT AGA AAG 



asp asn ser ser his leu pro 
GAC AAC TCT TCT CAT CTT CCA 

1430 

lys asp ser gly val gin glu 
AAA GAT TCT GGG GTC CAA GAA 



1440 

ser ser his phe leu gin gly ala lys lys asn asn leu 
AGC AGT CAT TTC TTA CAA GGA GCC AAA AAA AAT AAC CTT 



ser leu ala ile leu thr 
TCT TTA GCC ATT CTA ACC 



1450 

leu glu met thr gly asp gin 
TTG GAG ATG ACT GGT GAT CAA 



1460 

arg glu val gly ser leu gly thr ser ala thr asn ser 
AGA GAG GTT GGC TCC CTG GGG ACA AGT GCC ACA AAT TCA 



1470 1480 

val thr tyr lys lys val glu asn thr val leu pro lys 

GTC ACA TAC AAG AAA GTT GAG AAC ACT GTT CTC CCG AAA 



1490 

pro asp leu pro lys thr ser gly lys val glu leu leu 
CCA GAC TTG CCC AAA ACA TCT GGC AAA GTT GAA TTG CTT 



1500 

pro lys val his ile tyr gin lys asp leu phe pro thr 
CCA AAA GTT CAC ATT TAT CAG AAG GAC CTA TTC CCT ACG 
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1510 1520 
glu thr ser asn gly ser pro gly his leu asp leu val 
GAA ACT AGC AAT GGG TCT CCT GGC CAT CTG GAT CTC GTG 

1530 

glu gly ser leu leu gin gly thr glu gly ala ile lys 
GAA GGG AGC CTT CTT CAG GGA ACA GAG GGA GCG ATT AAG 

1540 

trp asn glu ala asn arg pro gly lys val pro phe leu 
TGG AAT GAA GCA AAC AGA CCT GGA AAA GTT CCC TTT CTG 

1550 1560 
arg val ala thr glu ser ser ala lys thr pro ser lys 

AGA GTA GCA ACA GAA AGC TCT GCA AAG ACT CCC TCC AAG 

bamHI 1570 
leu leu asp pro leu ala trp asp asn his tyr gly thr 
CTA TTG GAT CCT CTT GCT TGG GAT AAC CAC TAT GGT ACT 

1580 

gin ile pro lys glu glu trp lys ser gin glu lys ser 
CAG ATA CCA AAA GAA GAG TGG AAA TCC CAA GAG AAG TCA 

1590 

pro glu lys thr ala phe lys lys lys asp thr ile leu 
CCA GAA AAA ACA GCT TTT AAG AAA AAG GAT ACC ATT TTG 

1600 1610 

ser leu asn ala cys glu ser asn his ala ile ala ala 

TCC CTG AAC GCT TGT GAA AGC AAT CAT GCA ATA GCA GCA 

1620 

ile asn glu gly gin asn lys pro glu ile glu val thr 
ATA AAT GAG GGA CAA AAT AAG CCC GAA ATA GAA GTC ACC 
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1630 

trp ala lys gin gly arg thr glu arg leu cys ser gin 
TGG CCA AAG CAA GGT AGG ACT GAA AGG CTG TGC TCT CAA 

1640 1650 
asn pro pro val leu lys arg his gin arg glu ile thr 
AAC CCA CCA GTC TTG AAA CGC CAT CAA CGG GAA ATA ACT 

1660 

arg thr thr leu gin ser asp gin glu glu ile asp tyr 
CGT ACT ACT CTT CAG TCA GAT CAA GAG GAA ATT GAC TAT 

1670 

asp asp thr ile ser val glu met lys lys glu asp phe 
GAT GAT ACC ATA TCA GTT GAA ATG AAG AAG GAA GAT TTT 

1680 1690 
asp ile tyr asp glu asp glu asn gin ser pro arg ser 
GAC ATT TAT GAT GAG GAT GAA AAT CAG AGC CCC CGC AGC 

1700 

phe gin lys lys thr arg his tyr phe ile ala ala val 
TTT CAA AAG AAA ACA CGA CAC TAT TTT ATT GCT GCA GTG 

1710 

glu arg leu trp asp tyr gly met ser ser ser pro his 
GAG AGG CTC TGG GAT TAT GGG ATG AGT AGC TCC CCA CAT 

1720 

val leu arg asn arg ala gin ser gly ser val pro gin 
GTT CTA AGA AAC AGG GCT CAG AGT GGC AGT GTC CCT CAG 

1730 1740 

phe lys lys val val phe gin glu phe thr asp gly ser 

TTC AAG AAA GTT GTT TTC CAG GAA TTT ACT GAT GGC TCC 
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1750 

phe thr gin pro leu tyr arg gly glu leu asn glu his 
TTT ACT CAG CCC TTA TAC CGT GGA GAA CTA AAT GAA CAT 

1760 

leu gly leu leu gly pro tyr ile arg ala glu val glu 
TTG GGA CTC CTG GGG CCA TAT ATA AGA GCA GAA GTT GAA 

1770 1780 
asp asn ile met val thr phe arg asn gin ala ser arg 
GAT AAT ATC ATG GTA ACT TTC AGA AAT CAG GCC TCT CGT 

1790 

pro tyr ser phe tyr ser ser leu ile ser tyr glu glu 
CCC TAT TCC TTC TAT TCT AGC CTT ATT TCT TAT GAG GAA 

1800 

asp gin arg gin gly ala glu pro arg lys asn phe val 
GAT CAG AGG CAA GGA GCA GAA CCT AGA AAA AAC TTT GTC 

1810 1820 
lys pro asn glu thr lys thr tyr phe trp lys val gin 
AAG CCT AAT GAA ACC AAA ACT TAC TTT TGG AAA GTG CAA 

1830 

his his met ala pro thr lys asp glu phe asp cys lys 
CAT CAT ATG GCA CCC ACT AAA GAT GAG TTT GAC TGC AAA 

1840 

ala trp ala tyr phe ser asp val asp leu glu lys asp 
GCC TGG GCT TAT TTC TCT GAT GTT GAC CTG GAA AAA GAT 

1850 

val his ser gly leu ile gly pro leu leu val cys his 
GTG CAC TCA GGC CTG ATT GGA CCC CTT CTG GTC TGC CAC 
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I860 1870 n fc 

thr asn thr leu asn pro ala his gly arg gin val thr 

ACT AAC ACA CTG AAC CCT GCT CAT GGG AGA CAA GTG ACA 

1880 

val gin glu phe ala leu phe phe thr ile phe asp glu 
GTA CAG GAA TTT GCT CTG TTT TTC ACC ATC TTT GAT GAG 

1890 

thr lys ser trp tyr phe thr glu asn met glu arg asn 
ACC AAA AGC TGG TAC TTC ACT GAA AAT ATG GAA AGA AAC 

1900 1910 
cys arg ala pro cys asn ile gin met glu asp pro thr 
TGC AGG GCT CCC TGC AAT ATC CAG ATG GAA GAT CCC ACT 

1920 

phe lys glu asn tyr arg phe his ala ile asn gly tyr 
TTT AAA GAG AAT TAT CGC TTC CAT GCA ATC AAT GGC TAC 

1930 

ile met asp thr leu pro gly leu val met ala gin asp 
ATA ATG GAT ACA CTA CCT GGC TTA GTA ATG GCT CAG GAT 

1940 1950 
gin arg ile arg trp tyr leu leu ser met gly ser asn 
CAA AGG ATT CGA TGG TAT CTG CTC AGC ATG GGC AGC AAT 

1960 

glu asn ile his ser ile his phe ser gly his val phe 
GAA AAC ATC CAT TCT ATT CAT TTC AGT GGA CAT GTG TTC 

1970 

thr val arg lys lys glu glu tyr lys met ala leu tyr 

ACT GTA CGA AAA AAA GAG GAG TAT AAA ATG GCA CTG TAC 
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1980 

asn leu tyr pro gly val phe glu thr val glu met leu 
AAT CTC TAT CCA GGT GTT TTT GAG ACA GTG GAA ATG TTA 

1990 2000 

pro ser lys ala gly ile trp arg val glu cys leu ile 

CCA TCC AAA GCT GGA ATT TGG CGG GTG GAA TGC CTT ATT 

2010 

gly glu his leu his ala gly met ser thr leu phe leu 
GGC GAG CAT CTA CAT GCT GGG ATG AGC ACA CTT TTT CTG 

2020 

val tyr ser asn lys cys gin thr pro leu gly met ala 
GTG TAC AGC AAT AAG TGT CAG ACT CCC CTG GGA ATG GCT 

2030" 2040 
ser gly his ile arg asp phe gin ile thr ala ser gly 
TCT GGA CAC ATT AGA GAT TTT CAG ATT ACA GCT TCA GGA 

2050 

gin tyr gly gin trp ala pro lys leu ala arg leu his 
CAA TAT GGA CAG TGG GCC CCA AAG CTG GCC AGA CTT CAT 

2060 

tyr ser gly ser ile asn ala trp ser thr lys glu pro 
TAT TCC GGA TCA ATC AAT GCC TGG AGC ACC AAG GAG CCC 

2070 2080 
phe ser trp ile lys val asp leu leu ala pro met ile 
TTT TCT TGG ATC AAG GTG GAT CTG TTG GCA CCA ATG ATT 

2090 

ile his gly ile lys thr gin gly ala arg gin lys phe 
ATT CAC GGC ATC AAG ACC CAG GGT GCC CGT CAG AAG TTC 
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2100 

ser ser leu tyr ile ser gin phe ile ile met tyr ser 
TCC AGC CTC TAC ATC TCT CAG TTT ATC ATC ATG TAT AGT 

2110 

leu asp gly lys lys trp gin thr tyr arg gly asn ser 
CTT GAT GGG AAG AAG TGG CAG ACT TAT CGA GGA AAT TCC 

2120 2130 

thr gly- thr leu met val phe phe gly asn val asp ser 

ACT GGA ACC TTA ATG GTC TTC TTT GGC AAT GTG GAT TCA 

2140 

ser gly ile lys his asn ile phe asn pro pro ile ile 
TCT GGG ATA AAA CAC AAT ATT TTT AAC CCT CCA ATT ATT 

2150 

ala arg tyr ile arg leu his pro thr his tyr ser ile 
GCT CGA TAC ATC CGT TTG CAC CCA ACT CAT TAT AGC ATT 

2160 2170 
arg ser thr leu arg met glu leu met gly cys asp leu 
CGC AGC ACT CTT CGC ATG GAG TTG ATG GGC TGT GAT TTA 

sphl 2180 
asn ser cys ser met pro leu gly met glu ser lys ala 
AAT AGT TGC AGC ATG CCA TTG GGA ATG GAG AGT AAA GCA 

2190 

ile ser asp ala gin ile thr ala ser ser tyr phe thr 
ATA TCA GAT GCA CAG ATT ACT GCT TCA TCC TAC TTT ACC 

2200 2210 
asn met phe ala thr trp ser pro ser lys ala arg leu 
AAT ATG TTT GCC ACC TGG TCT CCT TCA AAA GCT CGA CTT 
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2220 



his leu gin gly arg ser asn ala trp arg pro gin val 
CAC CTC CAA GGG AGG AGT AAT GCC TGG AGA CCT CAG GTG 



asn asn pro lys glu txp leu gin val asp phe gin lys 
AAT AAT CCA AAA GAG TGG CTG CAA GTG GAC TTC CAG AAG 

2240 

thr met lys val thr gly val thr thr gin gly val lys 
ACA ATG AAA GTC ACA GGA GTA ACT ACT CAG GGA GTA AAA 

2250 2260 

ser leu leu thr ser met tyr val lys glu phe leu ile 

TCT CTG CTT ACC AGC ATG TAT GTG AAG GAG TTC CTC ATC 

2270 

ser ser ser gin asp gly his gin trp thr leu phe phe 
TCC AGC AGT CAA GAT GGC CAT CAG TGG ACT CTC TTT TTT 

2280 

gin asn gly lys val lys val phe gin gly asn gin asp 
CAG AAT GGC AAA GTA AAG GTT TTT CAG GGA AAT CAA GAC 

2290 2300 
ser phe thr pro val val asn ser leu asp pro pro leu 
TCC TTC ACA CCT GTG GTG AAC TCT CTA GAC CCA CCG TTA 



leu thr arg tyr leu arg ile his pro gin ser trp val 
CTG ACT CGC TAC CTT CGA ATT CAC CCC CAG AGT TGG GTG 

2320 

his gin ile ala leu arg met glu val leu gly cys glu 
CAC CAG ATT GCC CTG AGG ATG GAG GTT CTG GGC TGC GAG 

2330 2332 

ala gin asp leu tyr OP , 



2230 



ecoRI 



2310 



GCA CAG 



GAC CTC TAC TGA 
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